Automatic Speaker Verification

Cargado por

Subhangi Dutta

0% encontró este documento útil (0 votos)

11 vistas10 páginas

Cepstral method

Derechos de autor

Formatos disponibles

PPT, PDF, TXT o lea en línea desde Scribd

Compartir este documento

Compartir o incrustar documentos

Opciones para compartir

¿Le pareció útil este documento?

¿Este contenido es inapropiado?

Denunciar este documento

Cepstral method

Copyright:

Formatos disponibles

Descargue como PPT, PDF, TXT o lea en línea desde Scribd

Marcar por contenido inapropiado

0% encontró este documento útil (0 votos)

11 vistas10 páginas

Automatic Speaker Verification

Cargado por

Subhangi Dutta

Cepstral method

Copyright:

Formatos disponibles

Descargue como PPT, PDF, TXT o lea en línea desde Scribd

Marcar por contenido inapropiado

Saltar a página

Está en la página 1de 10

Buscar dentro del documento

SUBHANGI DUTTA

ECE SECTION C
ROLL 137
Speaker verification is a process to accept or reject the
identity claim of a speaker by comparing a set of
measurements of the speakers utterances with a reference
set of measurements of the utterance of the person whose
identity is claimed.

Features selected for analysis may include pitch, intensity,
the first three formants, and selected prediction
coefficients.

This presentation focuses on the cepstral analysis method
as proposed by Furui.
Block diagram indicating principal operations as proposed by
Furui.
There are two inputs to the system, the identity claim
and the sample utterance. The identity claim causes
reference data corresponding to the claim to be
retrieved.

The second input is the sample utterance. This speech
wave is bandlimited from 100 Hz to 3.0 kHz and
sampled at a 6.67 kHz rate.

The recording interval is scanned to find the
endpoints of the utterance. The endpoint detection is
accomplished by means of an energy calculation.

A 30 ms Hamming window is applied to the
emphasized speech signal every 10 ms.

The utterance is then analyzed to extract the first to
tenth-order Linear predictor coefficients successively
from each frame by the auto-correlation method.

These coefficients are transformed into cepstrum
coefficients using the following recursive relationship

where ci and ai are the ith-order cepstrum
coefficient and linear predictor
coefficient, respectively.
The cepstrum coefficients are averaged over the
duration of the entire utterance and the average values
are subtracted from the cepstrum coefficients of every
frame.

This compensates for frequency-response distortions
introduced by the transmission system.
The time functions of the normalized cepstrum
coefficients are expanded by an orthogonal polynomial
representation over short time segments (90 ms intervals
every 10 ms).

Then the utterance is represented by the time functions of
coefficients of the orthogonal polynomial representation.

A part of the set of these coefficients which are most
effective in separating the overall distance distribution of
customer and impostor sample utterances are selected for
speaker verification. The selection is made based on the
inter-to-intra speaker variability ratio.
The sample utterance is brought into time registration with
the reference template to calculate the distance between
them using a dynamic programming technique.

We denote two contours as R(n), 1 <= n <= N, and T(m),
1 <= m <= M. We denote R(n) and T(m) as guide contour
and slave contour, respectively.

The purpose of the time warping algorithm is to provide a
mapping between the time indexes n and m such that a
time registration between the two utterances is obtained.
We denote the mapping w, between n and m as m = w(n).
A similarity measure or distance function D must be
defined for every pair of points (n, m).
The distance of each element is weighted by
intraspeaker variability and summed to produce the
overall distance.
Finally, the overall distance is compared with a
threshold distance value to determine whether the
identity claim should be accepted or rejected.

También podría gustarte

Digital Signal Processing Speech Recognition Paper
Documento12 páginas
Digital Signal Processing Speech Recognition Paper
Siri Sreeja
Aún no hay calificaciones
Recognizing Voice For Numerics Using MFCC and DTW
Documento4 páginas
Recognizing Voice For Numerics Using MFCC and DTW
International Journal of Application or Innovation in Engineering & Management
Aún no hay calificaciones
Accurate Calculation of Harmonics/Inter-Harmonics of Time Varying Load
Documento4 páginas
Accurate Calculation of Harmonics/Inter-Harmonics of Time Varying Load
IOSRJEN : hard copy, certificates, Call for Papers 2013, publishing of journal
Aún no hay calificaciones
An Automatic Speaker Recognition System
Documento11 páginas
An Automatic Speaker Recognition System
Niomi Golrai
100% (1)
DSP Techniques Power Spectrum Estimation Lab
Documento3 páginas
DSP Techniques Power Spectrum Estimation Lab
Jai Gaizin
Aún no hay calificaciones
Ijaiem 2014 11 17 52
Documento5 páginas
Ijaiem 2014 11 17 52
International Journal of Application or Innovation in Engineering & Management
Aún no hay calificaciones
Automatic Speech Recognition using Correlation Analysis
Documento5 páginas
Automatic Speech Recognition using Correlation Analysis
Tra Le
Aún no hay calificaciones
Classifying Dysfluent & Fluent Speech Using MFCC & ML
Documento9 páginas
Classifying Dysfluent & Fluent Speech Using MFCC & ML
Padmini Palli
Aún no hay calificaciones
Voice Analysis Using Short Time Fourier Transform and Cross Correlation Methods
Documento6 páginas
Voice Analysis Using Short Time Fourier Transform and Cross Correlation Methods
prasad
Aún no hay calificaciones
Makerere University voice operated door lock project
Documento9 páginas
Makerere University voice operated door lock project
Calvin Elijah
Aún no hay calificaciones
A Comparative Study of Different Entropies For Spectrum Sensing Techniques
Documento15 páginas
A Comparative Study of Different Entropies For Spectrum Sensing Techniques
suchi87
Aún no hay calificaciones
A Low Complexity Channel Estimation Algorithm For Massive MIMO System
Documento12 páginas
A Low Complexity Channel Estimation Algorithm For Massive MIMO System
ilbahi
Aún no hay calificaciones
SPAWC 2013 Liao Etal HandoverOptimization
Documento5 páginas
SPAWC 2013 Liao Etal HandoverOptimization
mimbrillito
Aún no hay calificaciones
A New Silence Removal and Endpoint Detection Algorithm For Speech and Speaker Recognition Applications
Documento5 páginas
A New Silence Removal and Endpoint Detection Algorithm For Speech and Speaker Recognition Applications
Eng Smsma
Aún no hay calificaciones
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
Documento6 páginas
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
International Journal of computational Engineering research (IJCER)
Aún no hay calificaciones
High SNR Analysis For MIMO Broadcast Channels: Dirty Paper Coding vs. Linear Precoding
Documento33 páginas
High SNR Analysis For MIMO Broadcast Channels: Dirty Paper Coding vs. Linear Precoding
kumaar1943
Aún no hay calificaciones
Instantaneous Pitch Estimation Algorithm Based On Multirate Sampling
Documento5 páginas
Instantaneous Pitch Estimation Algorithm Based On Multirate Sampling
Raiatea Moeata
Aún no hay calificaciones
New Techniques For Automatic Speaker Verification
Documento8 páginas
New Techniques For Automatic Speaker Verification
shamzaam
Aún no hay calificaciones
Speaker Recognition Using Vocal Tract Features
Documento5 páginas
Speaker Recognition Using Vocal Tract Features
International Journal of Engineering Inventions (IJEI)
Aún no hay calificaciones
Performance of Diversity Combining Techniques For Antenna Arrays
Documento4 páginas
Performance of Diversity Combining Techniques For Antenna Arrays
Tushar Saxena
Aún no hay calificaciones
Euro97 Warp
Documento4 páginas
Euro97 Warp
Rishabh Mishra
Aún no hay calificaciones
Estimating Time Delay Using GCC For Speech Source Localisation
Documento7 páginas
Estimating Time Delay Using GCC For Speech Source Localisation
International Journal of Application or Innovation in Engineering & Management
Aún no hay calificaciones
WR Iee2
Documento6 páginas
WR Iee2
Sayed Ahmed Ali Alqallaf
Aún no hay calificaciones
On Using Transmission Overhead Efficiently For Channel Estimation in OFDM
Documento6 páginas
On Using Transmission Overhead Efficiently For Channel Estimation in OFDM
Anonymous uspYoqE
Aún no hay calificaciones
Simulation of The Ultrasonic Inspection of Composite Materials
Documento4 páginas
Simulation of The Ultrasonic Inspection of Composite Materials
LiebstandarteSSAH
Aún no hay calificaciones
3D CNN-Based SER
Documento17 páginas
3D CNN-Based SER
Ashwani Rathee
Aún no hay calificaciones
Multichannel Linear Equalisers
Documento4 páginas
Multichannel Linear Equalisers
Ibrahim Özkan
Aún no hay calificaciones
Speaker Recognition System Using MFCC and Vector Quantization
Documento7 páginas
Speaker Recognition System Using MFCC and Vector Quantization
lambanaveen
Aún no hay calificaciones
EEL6586 Final Project:: A Speaker Identification and Verification System
Documento16 páginas
EEL6586 Final Project:: A Speaker Identification and Verification System
Ramana Reddy
Aún no hay calificaciones
Quality Control For UMTS-AMR Speech Channels
Documento4 páginas
Quality Control For UMTS-AMR Speech Channels
payam12
Aún no hay calificaciones
10 1 1 149
Documento5 páginas
10 1 1 149
suspended22
Aún no hay calificaciones
Voice Recognition
Documento6 páginas
Voice Recognition
Vuong Luong
Aún no hay calificaciones
Implementation of Speech Recognition Using Artificial Neural Networks
Documento12 páginas
Implementation of Speech Recognition Using Artificial Neural Networks
Harman Singh Somal
Aún no hay calificaciones
Novel Approach For Detecting Applause in Continuous Meeting Speech
Documento5 páginas
Novel Approach For Detecting Applause in Continuous Meeting Speech
Manjeet Singh
Aún no hay calificaciones
MFCC Feature Extraction
Documento9 páginas
MFCC Feature Extraction
Abhishek Kushwaha
Aún no hay calificaciones
Modelling and Visualising the Human Vocal Tract in 3D
Documento4 páginas
Modelling and Visualising the Human Vocal Tract in 3D
thyagomend
Aún no hay calificaciones
Power System Harmonics Estimation Using Linear Least Squares Method and
Documento6 páginas
Power System Harmonics Estimation Using Linear Least Squares Method and
Vladimir Becejac
Aún no hay calificaciones
Speaker Identification E6820 Spring '08 Final Project Report Prof. Dan Ellis
Documento16 páginas
Speaker Identification E6820 Spring '08 Final Project Report Prof. Dan Ellis
Rakhi Ramachandran
Aún no hay calificaciones
Muffler Modeling by Transfer Matrix Method and Experimental Verification
Documento9 páginas
Muffler Modeling by Transfer Matrix Method and Experimental Verification
Supatmono NAI
Aún no hay calificaciones
Ma Kale
Documento3 páginas
Ma Kale
salihyucael
Aún no hay calificaciones
IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control
Documento5 páginas
IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control
Mostafa Abdelrahman
Aún no hay calificaciones
Vector Quantization Approach For Speaker Recognition Using MFCC and Inverted MFCC
Documento7 páginas
Vector Quantization Approach For Speaker Recognition Using MFCC and Inverted MFCC
Manish Pradhan
Aún no hay calificaciones
Speech Feature Extraction
Documento9 páginas
Speech Feature Extraction
Georgy Abraham
Aún no hay calificaciones
Digital Representation of Speech Waveform: Sampling of Speech Signals Review of The Statistical Model For Speech
Documento7 páginas
Digital Representation of Speech Waveform: Sampling of Speech Signals Review of The Statistical Model For Speech
profmns
Aún no hay calificaciones
Chen1993 tecnicaPulso-ecoBandaAncha
Documento5 páginas
Chen1993 tecnicaPulso-ecoBandaAncha
Fabian Torres Robles
Aún no hay calificaciones
Fusion of Spectrograph and LPC Analysis For Word Recognition: A New Fuzzy Approach
Documento6 páginas
Fusion of Spectrograph and LPC Analysis For Word Recognition: A New Fuzzy Approach
susasuresh
Aún no hay calificaciones
NVH Fallstudie Frequency Analysis
Documento5 páginas
NVH Fallstudie Frequency Analysis
reek_bhat
Aún no hay calificaciones
CM Kaltfl 080709
Documento5 páginas
CM Kaltfl 080709
Võ Quy Quang
Aún no hay calificaciones
Department of E.C.E.: Digital Communications Lab Manual
Documento29 páginas
Department of E.C.E.: Digital Communications Lab Manual
మొక్కపాటి మాధవి
Aún no hay calificaciones
Analysis of How Antenna Size Impacts MIMO System Capacity and Performance
Documento19 páginas
Analysis of How Antenna Size Impacts MIMO System Capacity and Performance
Nebiye Solomon
Aún no hay calificaciones
Fast Fourier Transform of Frequency Hopping Spread Spectrum in Noisy Environment
Documento5 páginas
Fast Fourier Transform of Frequency Hopping Spread Spectrum in Noisy Environment
Naveed Ramzan
Aún no hay calificaciones
Ultrasonic Signal De-Noising Using Dual Filtering Algorithm
Documento8 páginas
Ultrasonic Signal De-Noising Using Dual Filtering Algorithm
vhito619
Aún no hay calificaciones
Queueing Model
Documento54 páginas
Queueing Model
Bikram Kumar
Aún no hay calificaciones
GSM Speech Quality Measures Based on Transmission Parameters
Documento5 páginas
GSM Speech Quality Measures Based on Transmission Parameters
Grecu Sorin
Aún no hay calificaciones
Dynamic Compressive Spectrum Sensing Algorithm for Cognitive Radio Networks
Documento6 páginas
Dynamic Compressive Spectrum Sensing Algorithm for Cognitive Radio Networks
Jayaraman Tamilvendhan
Aún no hay calificaciones
A Low Cost TDOA Localization System: Setup, Challenges and Results
Documento4 páginas
A Low Cost TDOA Localization System: Setup, Challenges and Results
cuteabhi47
Aún no hay calificaciones
2000 - Data-Driven Temporal Filters and Alternatives To GMM in Speaker Verification
Documento20 páginas
2000 - Data-Driven Temporal Filters and Alternatives To GMM in Speaker Verification
Susanta Sarangi
Aún no hay calificaciones
Software Radio: Sampling Rate Selection, Design and Synchronization
De Everand
Software Radio: Sampling Rate Selection, Design and Synchronization
Elettra Venosa
Aún no hay calificaciones
Error-Correction on Non-Standard Communication Channels
De Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
Aún no hay calificaciones
Hamiltonian Monte Carlo Methods in Machine Learning
De Everand
Hamiltonian Monte Carlo Methods in Machine Learning
Tshilidzi Marwala
Aún no hay calificaciones
Report Bravo L Per00 4371
Documento90 páginas
Report Bravo L Per00 4371
Subhangi Dutta
Aún no hay calificaciones
India Needs An Energy Security Doctrine - Livemint
Documento9 páginas
India Needs An Energy Security Doctrine - Livemint
Subhangi Dutta
Aún no hay calificaciones
Cochlear Implant
Documento1 página
Cochlear Implant
Subhangi Dutta
Aún no hay calificaciones
Ban Bossy Leadership Tips For Girls Shared by Glenn Laken Barbara Laken
Documento11 páginas
Ban Bossy Leadership Tips For Girls Shared by Glenn Laken Barbara Laken
Glenn Laken
Aún no hay calificaciones
Hotels in Manipal
Documento1 página
Hotels in Manipal
Subhangi Dutta
Aún no hay calificaciones
Jibanananda Das Dhusar Pandulipi
Documento48 páginas
Jibanananda Das Dhusar Pandulipi
Arkava Das
100% (1)
Weekend Places
Documento87 páginas
Weekend Places
Subhangi Dutta
Aún no hay calificaciones
Suprava Profile
Documento9 páginas
Suprava Profile
Chetna Shah
Aún no hay calificaciones
Cambridge University Engineering Department Trumpington Street, Cambridge, England, CB2 1PZ
Documento4 páginas
Cambridge University Engineering Department Trumpington Street, Cambridge, England, CB2 1PZ
Bhuvan Gupta
Aún no hay calificaciones
Speech Recognition For Bangla Digit Using DTW & CNN in Matlab
Documento13 páginas
Speech Recognition For Bangla Digit Using DTW & CNN in Matlab
Mehedi Hossen Limon
Aún no hay calificaciones
Real-Time Time-Domain Pitch Tracking Using Wavelets
Documento12 páginas
Real-Time Time-Domain Pitch Tracking Using Wavelets
youdiez
Aún no hay calificaciones
Classifying speech using K-NN model
Documento7 páginas
Classifying speech using K-NN model
Naga Raju G
Aún no hay calificaciones
Product Data Sheet: VIBRO Condition Monitoring 3 (VCM-3 and VCM-3 Ex)
Documento7 páginas
Product Data Sheet: VIBRO Condition Monitoring 3 (VCM-3 and VCM-3 Ex)
Amir Habib
Aún no hay calificaciones
Speaker Verification in Java
Documento35 páginas
Speaker Verification in Java
Phat Luong
100% (1)
MFCCs
Documento12 páginas
MFCCs
Vineeth Bhaskara
Aún no hay calificaciones
C14 - Speech Emotion Recognition Using Machine Learning
Documento118 páginas
C14 - Speech Emotion Recognition Using Machine Learning
Perinban D
Aún no hay calificaciones
3.home Automation
Documento6 páginas
3.home Automation
I.k. Neena
Aún no hay calificaciones
Gear Noise
Documento26 páginas
Gear Noise
Waqar Siddiqui
100% (1)
Synthetic Speech Detection Through Short Term and Long-Term Prediction Traces
Documento14 páginas
Synthetic Speech Detection Through Short Term and Long-Term Prediction Traces
Mohsin Sarfraz
Aún no hay calificaciones
Seminar Report On Voice Morphing
Documento36 páginas
Seminar Report On Voice Morphing
Naresh Mahor
Aún no hay calificaciones
MFCC CZT
Documento10 páginas
MFCC CZT
Sufyan Nikanin
Aún no hay calificaciones
Seewave Analysis
Documento17 páginas
Seewave Analysis
Javier Toledo
Aún no hay calificaciones
Envelope Analysis
Documento9 páginas
Envelope Analysis
rfriosEP
Aún no hay calificaciones
Audio Signal Processing Basics
Documento11 páginas
Audio Signal Processing Basics
SRINIVASAN
Aún no hay calificaciones
Automatic Speaker Recognition Using LPCC and MFCC
Documento4 páginas
Automatic Speaker Recognition Using LPCC and MFCC
Editor IJRITCC
Aún no hay calificaciones
A Practical Handbook of Speech Coders
Documento15 páginas
A Practical Handbook of Speech Coders
Vany Braun
Aún no hay calificaciones
MFCC Features: Appendix A
Documento19 páginas
MFCC Features: Appendix A
LaZeR
Aún no hay calificaciones
Debabala Swain Machine Learning and Information 2020
Documento533 páginas
Debabala Swain Machine Learning and Information 2020
MAHMOUD CERAY
Aún no hay calificaciones
Mel-Scaled Filter Bank: Mel (F) 2595 Log10 (1+f/700)
Documento3 páginas
Mel-Scaled Filter Bank: Mel (F) 2595 Log10 (1+f/700)
amrithageorge
Aún no hay calificaciones
Voice Morphing Final Report
Documento33 páginas
Voice Morphing Final Report
rahuljindal
Aún no hay calificaciones
MATLAB Functionality For Digital Speech Processing
Documento51 páginas
MATLAB Functionality For Digital Speech Processing
Osman Khan
Aún no hay calificaciones
Loudspeaker Minimum Phase Estimation Methods
Documento5 páginas
Loudspeaker Minimum Phase Estimation Methods
Corina Popa
Aún no hay calificaciones
Quran Recognition Jurnal
Documento10 páginas
Quran Recognition Jurnal
suhailan_safei
Aún no hay calificaciones
Voice Morphing Seminar Report
Documento36 páginas
Voice Morphing Seminar Report
Vinay Reddy
Aún no hay calificaciones
Speaker Diarization
Documento47 páginas
Speaker Diarization
Avinash Pandey
Aún no hay calificaciones
Voice Activation Using Speaker Recognition For Controlling Humanoid Robot
Documento6 páginas
Voice Activation Using Speaker Recognition For Controlling Humanoid Robot
Dyah Ayu Anggreini
Aún no hay calificaciones
Automatic Speech Recognition
Documento45 páginas
Automatic Speech Recognition
rahul
Aún no hay calificaciones