91% LM-dGMM meilleure itération (it. 1) 8.40% dernì ere itération (it. 30) 12.80% (a) ModèlesModèlesà 256 gaussiennes ,
74% LM-dGMM meilleure itération (it. 1) 9.66% dernì ere itération (it. 9) 10, p.47 ,
90% LM-dGMM meilleure itération (it. 1) 8, 85% dernì ere itération (it. 9) 9.66% (b) ModèlesModèlesà 512 gaussiennes ,
22: EER de systèmes GMM et LM-dGMM, avec et sans une T-normalisation des scores ,
le puissant formalisme SFA permet d'´ eliminer efficacement la variabilité inter-sessions, sans faire appeì a la T-normalisation des scores. Nous n'allons donc plus utiliser cette technique de normalisation dans les prochaines expériences, Les tableaux Tab. 4.23 et Tab. 4.24 rappellent les performances du système-GMM et du système-LM-dGMM sans compensation, et affichent ceux des système-GMM-SFA et système-LM-dGMM-SFA o` u la compensation est utilisée, avec respectivement M = 256, 2007. ,
nous avons proposé d'utiliser une nouvelle approche discriminante pour la reconnaissance automatique du locuteur qui consistè a utiliser des modèles GMMàGMMà grande marge, appelés LM-dGMM. Nos modèles reposent sur une récente approche discriminante pour la séparation multi-classes, qui a ´ eté appliquée en reconnaissance de la parole ; les modèles LM-GMM. Les LM-dGMM sont définis par un vecteur centro¨?decentro¨?de, une matrice de covariance diagonale et un offset ,
l'apprentissage des LM-dGMM est beaucoup moins complexe : l'algorithme d'apprentissage simplifié est plus rapide et moins demandeur de mémoire, De plus, ces modèles donnent de bien meilleurs résultats que les modèles LM-GMM originaux ,
Modeling prosodic dynamics for speaker recognition, Sous-système 5 : Proc. of ICASSP, pp.788-791, 2003. ,
Gender-dependent phonetic refraction for speaker recognition, IEEE International Conference on Acoustics Speech and Signal Processing, pp.149-152, 2002. ,
DOI : 10.1109/ICASSP.2002.1005698
A novel speaker binary key derived from anchor models, Proc. of INTERSPEECH, pp.2118-2121, 2010. ,
Robust Bayesian clustering, Neural Networks, vol.20, issue.1, pp.129-138, 2007. ,
DOI : 10.1016/j.neunet.2006.06.009
Neural models for extracting speaker characteristics in speech modelization systems, Proc. of EUROSPEECH, pp.2263-2266, 1993. ,
Automatic Speaker Recognition Based on Pitch Contours, The Journal of the Acoustical Society of America, vol.52, issue.6B, pp.1687-1697, 1972. ,
DOI : 10.1121/1.1913303
Score Normalization for Text-Independent Speaker Verification Systems, Digital Signal Processing, vol.10, issue.1-3, pp.1-342, 2000. ,
DOI : 10.1006/dspr.1999.0360
Feature and score normalization for speaker verification of cellular data, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., pp.49-52, 2003. ,
DOI : 10.1109/ICASSP.2003.1202291
Prosodic parameter for speaker identification, Proc. of ICSLP, pp.1197-1200, 2002. ,
Learning the decision function for speaker verification, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), pp.425-428, 2001. ,
DOI : 10.1109/ICASSP.2001.940858
A connectionist approach for automatic speaker identification, International Conference on Acoustics, Speech, and Signal Processing, pp.265-268, 1990. ,
DOI : 10.1109/ICASSP.1990.115619
Nonlinear programming, Athena Scientific, 1999. ,
Recherche du rôle des intervenants et de leurs interactions pour la structuration de documents audiovisuels, 2011. ,
Standard and target driven AR-vector models for speech analysis and speaker recognition, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.5-8, 1992. ,
DOI : 10.1109/ICASSP.1992.226134
Pattern recognition and machine learning, 2006. ,
Speaker recognition using syllable-based constraints for cepstral frame selection, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4525-4528, 2009. ,
DOI : 10.1109/ICASSP.2009.4960636
ALIZE/SpkDet : a state-of-the-art open source software for speaker recognition, Proc. of Odyssey -The Speaker and Language Recognition Workshop, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-01312982
Tools for Fusion and Calibration of automatic speaker detection systems, Online, 2005. ,
Application-independent evaluation of speaker detection, Computer Speech & Language, vol.20, issue.2-3, pp.230-275, 2006. ,
DOI : 10.1016/j.csl.2005.08.001
Discriminatively trained Probabilistic Linear Discriminant Analysis for speaker verification, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4832-4835, 2011. ,
DOI : 10.1109/ICASSP.2011.5947437
Text-dependent speaker verification using vector quantization source coding, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.35, issue.2, pp.133-143, 1987. ,
DOI : 10.1109/TASSP.1987.1165110
Fusing high-and low-level features for speaker recognition, Proc. of EUROSPEECH, pp.2665-2668, 2003. ,
Generalized linear discriminant sequence kernels for speaker recognition, Proc. of ICASSP, pp.161-164, 2002. ,
Weighted Nuisance Attribute Projection, Proc. of Odyssey -The Speaker and Language Recognition Workshop, pp.97-102, 2010. ,
Phonetic Speaker Recognition with Support Vector Machines, Advances in Neural Information Processing Systems 16, pp.1377-1384, 2004. ,
Support vector machines for speaker and language recognition, Computer Speech & Language, vol.20, issue.2-3, pp.210-229, 2006. ,
DOI : 10.1016/j.csl.2005.06.003
Speaker Comparison with Inner Product Discriminant Functions, Advances in Neural Information Processing Systems 22, pp.207-215, 2009. ,
Fusing discriminative and generative methods for speaker recognition : experiments on switchboard and NFI/TNO field data, Proc. of Odyssey -The Speaker and Language Recognition Workshop, pp.41-44, 2004. ,
Support vector machines using GMM supervectors for speaker verification, IEEE Signal Processing Letters, vol.13, issue.5, pp.308-311, 2006. ,
DOI : 10.1109/LSP.2006.870086
Multigrained modeling with pattern specific maximum likelihood transformations for text-independent speaker recognition, IEEE Transactions on Speech and Audio Processing, vol.11, issue.1, pp.61-69, 2003. ,
DOI : 10.1109/TSA.2003.809121
Speaker verification using fundamental frequency, Proc. of ICSLP, pp.161-164, 1998. ,
Probabilistic anchor models approach for speaker verification, Proc. of INTERSPEECH, pp.2005-2008, 2005. ,
Support-vector networks, Machine Learning, pp.273-297, 1995. ,
DOI : 10.1007/BF00994018
Statistical Voice Activity Detection Using Low-Variance Spectrum Estimation and an Adaptive Threshold, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.2, pp.412-424, 2006. ,
DOI : 10.1109/TSA.2005.855842
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.28, issue.4, pp.357-366, 1980. ,
DOI : 10.1109/TASSP.1980.1163420
YIN, a fundamental frequency estimator for speech and music, The Journal of the Acoustical Society of America, vol.111, issue.4, pp.1917-1930, 2002. ,
URL : https://hal.archives-ouvertes.fr/hal-01106271
Comparison of hidden Markov model techniques for automatic speaker verification in real-world conditions, Speech Communication, vol.17, issue.1-2, pp.81-90, 1995. ,
DOI : 10.1016/0167-6393(95)00015-G
Theory of Statistical Estimation, Proc. of the Cambridge Philosophical Society, pp.700-725, 1925. ,
DOI : 10.1017/S0305004100009580
AMIRAL: A Block-Segmental Multirecognizer Architecture for Automatic Speaker Recognition, Digital Signal Processing, vol.10, issue.1-3, pp.1-3172, 2000. ,
DOI : 10.1006/dspr.1999.0367
Cepstral analysis technique for automatic speaker verification, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.29, issue.2, pp.254-272, 1981. ,
DOI : 10.1109/TASSP.1981.1163530
Locally recurrent probabilistic neural networks with application to speaker verification, GESTS International Transaction on Speech Science and Engineering, vol.1, issue.2, pp.1-13, 2004. ,
Analysis of i-vector Length Normalization in Speaker Recognition Systems, Proc. of INTER- SPEECH, pp.249-252, 2011. ,
Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech, Proc. of International Conference on Multimedia and Expo, pp.205-208, 2003. ,
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Transactions on Speech and Audio Processing, vol.2, issue.2, pp.291-298, 1994. ,
DOI : 10.1109/89.279278
Comparison of scoring methods used in speaker recognition with Joint Factor Analysis, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4057-4060, 2009. ,
DOI : 10.1109/ICASSP.2009.4960519
SPro : " Speech Signal Processing Toolkit " . Online : https, 2003. ,
Utilisation de la prédiction linéaire en reconnaissance et adaptation au locuteur, Proc. of XIèmes Journées d' ´ Etudes sur la Parole (JEP), pp.163-171, 1980. ,
Distance measures for textindependent speaker recognition based on MAR model, Proc. of ICASSP, pp.309-312, 1994. ,
Voice source cepstrum coefficients for speaker identification, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4821-4824, 2008. ,
DOI : 10.1109/ICASSP.2008.4518736
Speaker recognition using phoneme-specific gmms, Proc. of Odyssey -The Speaker and Language Recognition Workshop, pp.179-184, 2004. ,
Within-class covariance normalization for SVM-based speaker recognition, Proc. of INTERSPEECH, pp.1471-1474, 2006. ,
Generalized Linear Kernels for One-Versus-All Classification: Application to Speaker Recognition, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, pp.585-588, 2006. ,
DOI : 10.1109/ICASSP.2006.1661343
Text-independent speaker recognition using neural networks, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.153-156, 1992. ,
DOI : 10.1109/ICASSP.1992.226097
Maximum a Posteriori Adaptation of the Centroid Model for Speaker Verification, IEEE Signal Processing Letters, vol.15, pp.162-165, 2008. ,
DOI : 10.1109/LSP.2007.914792
A discriminative training algorithm for VQ-based speaker identification, IEEE Transactions on Speech and Audio Processing, vol.7, issue.3, pp.353-356, 1999. ,
Phonetic class-based speaker verification, Proc. of EUROSPEECH, pp.1665-1668, 2003. ,
Robustness to telephone handset distortion in speaker recognition by discriminative feature design, Speech Communication, vol.31, issue.2-3, pp.181-192, 2000. ,
DOI : 10.1016/S0167-6393(99)00077-1
Perceptual linear predictive (PLP) analysis of speech, The Journal of the Acoustical Society of America, vol.87, issue.4, pp.1738-1752, 1990. ,
DOI : 10.1121/1.399423
RASTA processing of speech, IEEE Transactions on Speech and Audio Processing, vol.2, issue.4, pp.578-589, 1994. ,
DOI : 10.1109/89.326616
SVM kernel adaptation in speaker classification and verification, Proc. of INTERSPEECH, pp.1413-1416, 2004. ,
A comparison of methods for multiclass support vector machines, IEEE Transactions on Neural Networks, vol.13, issue.2, pp.415-425, 2002. ,
Speaker verification with multiple classifier fusion using Bayes based confidence measure, Proc. of INTERSPEECH, pp.2041-2044, 2007. ,
Study of noise robust voice activity detection based on periodic component to aperiodic component ratio, Proc. of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition (SAPA), pp.65-70, 2006. ,
Interpolated estimation of markov source parameters from sparse data, Proc. of Workshop Pattern Recognition in Practice, pp.381-397, 1980. ,
Fast training of Large Margin diagonal Gaussian mixture models for speaker identification, 2011 6th Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp.1-4, 2011. ,
DOI : 10.1109/SPED.2011.5940738
URL : https://hal.archives-ouvertes.fr/hal-00647213
Speaker verification using large margin GMM discriminative training, 2011 International Conference on Multimedia Computing and Systems, pp.1-5, 2011. ,
DOI : 10.1109/ICMCS.2011.5945650
URL : https://hal.archives-ouvertes.fr/hal-00647232
Online First) Discriminative speaker recognition using Large Margin GMM, Journal of Neural Computing & Applications, pp.10-1007 ,
A new kernel for SVM MLLR based speaker recognition, Proc. of INTERSPEECH, pp.290-293, 2007. ,
Bayesian Speaker Verification with Heavy-Tailed Priors, Proc. of Odyssey -The Speaker and Language Recognition Workshop, 2010. ,
Eigenvoice modeling with sparse training data, IEEE transactions on speech and audio processing, pp.345-354, 2005. ,
DOI : 10.1109/TSA.2004.840940
Factor Analysis Simplified, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.637-640, 2005. ,
DOI : 10.1109/ICASSP.2005.1415194
Improvements in Factor Analysis Based Speaker Verification, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, pp.113-116, 2006. ,
DOI : 10.1109/ICASSP.2006.1659970
Joint Factor Analysis Versus Eigenchannels in Speaker Recognition, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.4, pp.1435-1447, 2007. ,
DOI : 10.1109/TASL.2006.881693
Speaker and Session Variability in GMM-Based Speaker Verification, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.4, pp.1448-1460, 2007. ,
DOI : 10.1109/TASL.2007.894527
A Study of Interspeaker Variability in Speaker Verification, IEEE Transactions on Audio, Speech, and Language Processing, vol.16, issue.5, pp.980-988, 2008. ,
DOI : 10.1109/TASL.2008.925147
Automatic speech and speaker recognition : Large margin and kernel methods, 2009. ,
DOI : 10.1002/9780470742044
Combining GMM's with Suport Vector Machines for Text-independent Speaker Verification, Proc. of EUROSPEECH, pp.1761-1764, 2001. ,
On separating glottal source and vocal tract information in telephony speaker verification, Proc. of ICASSP, pp.4545-4548, 2009. ,
Fusion of spectral feature sets for accurate speaker identification, Proc. of SPECOM, pp.361-365, 2004. ,
Comparative evaluation of maximum a Posteriori vector quantization and gaussian mixture models in speaker verification, Pattern Recognition Letters, vol.30, issue.4, pp.341-347, 2009. ,
DOI : 10.1016/j.patrec.2008.11.007
Conditional pronunciation modeling in speaker detection, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., pp.804-807, 2003. ,
DOI : 10.1109/ICASSP.2003.1202765
SVM-based Speaker Classification in the GMM Models Space, 2006 IEEE Odyssey, The Speaker and Language Recognition Workshop, 2006. ,
DOI : 10.1109/ODYSSEY.2006.248138
Rapid speaker adaptation in eigenvoice space, IEEE Transactions on Speech and Audio Processing, vol.8, issue.6, pp.695-707, 2000. ,
DOI : 10.1109/89.876308
An improved endpoint detector for isolated word recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.29, issue.4, pp.777-785, 1981. ,
DOI : 10.1109/TASSP.1981.1163642
Unsupervised speaker recognition based on competition between self-organizing maps, IEEE Transactions on Neural Networks, vol.13, issue.4, pp.877-887, 2002. ,
DOI : 10.1109/TNN.2002.1021888
Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4541-4544, 2009. ,
DOI : 10.1109/ICASSP.2009.4960640
Client Dependent GMM-SVM Models for Speaker Verification, Artificial Neural Networks and Neural Information Processing - ICANN/ICONIP, pp.443-451, 2003. ,
DOI : 10.1007/3-540-44989-2_53
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, Computer Speech & Language, vol.9, issue.2, pp.171-185, 1995. ,
DOI : 10.1006/csla.1995.0010
Adaptive articulatory feature-based conditional pronunciation modeling for speaker verification, Speech Communication, vol.48, issue.1, pp.71-84, 2006. ,
DOI : 10.1016/j.specom.2005.05.013
Robust endpoint detection and energy normalization for real-time speech and speaker recognition, IEEE Transactions on Speech and Audio Processing, vol.10, issue.3, pp.146-157, 2002. ,
DOI : 10.1109/TSA.2002.1001979
An Algorithm for Vector Quantizer Design, IEEE Transactions on Communications, vol.28, issue.1, pp.84-95, 1980. ,
DOI : 10.1109/TCOM.1980.1094577
On the limited memory BFGS method for large scale optimization, Mathematical Programming, pp.503-528, 1989. ,
DOI : 10.1007/BF01589116
Improved GMM-UBM/SVM for speaker verification, Proc. of ICASSP, pp.925-928, 2006. ,
Noyaux de séquences pour la vérification du locuteur par MachinesàMachinesà Vecteurs de Support, 2007. ,
Feature Space Mahalanobis Sequence Kernels: Application to SVM Speaker Verification, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.8, pp.152465-2475, 2007. ,
DOI : 10.1109/TASL.2007.905147
URL : https://hal.archives-ouvertes.fr/inria-00438823
Speaker cluster based GMM tokenization for speaker recognition, Proc. of INTERSPEECH, pp.505-508, 2006. ,
Some methods for classification and analysis of multivariate observations, Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, pp.281-297, 1967. ,
A further investigation on AR-vector models for text-independent speaker identification, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, pp.101-104, 1996. ,
DOI : 10.1109/ICASSP.1996.540300
Extraction of speaker-specific excitation information from linear prediction residual of speech, Speech Communication, issue.10, pp.481243-1261, 2006. ,
A comparison of various adaptation methods for speaker verification with limited enrollment data, Proc. of ICASSP, pp.929-932, 2006. ,
Speaker recognition by location in the space of reference speakers, Speech Communication, vol.48, issue.2, pp.127-141, 2006. ,
DOI : 10.1016/j.specom.2005.06.014
A comparative study of adaptation methods for speaker verification, Proc. of ICSLP, pp.581-584, 2002. ,
Linear prediction of speech, 1976. ,
DOI : 10.1007/978-3-642-66286-7
Long-term feature averaging for speaker recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.25, issue.4, pp.330-337, 1977. ,
DOI : 10.1109/TASSP.1977.1162961
Combining classifier decisions for robust speaker identification, Pattern Recognition, vol.39, issue.1, pp.147-155, 2006. ,
DOI : 10.1016/j.patcog.2005.08.004
A straightforward and efficient implementation of the factor analysis model for speaker verification, Proc. of INTERSPEECH, pp.1242-1245, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-01318480
Full-covariance UBM and heavy-tailed PLDA in i-vector speaker verification, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4828-4831, 2011. ,
DOI : 10.1109/ICASSP.2011.5947436
A logical calculus of the ideas immanent in nervous activity, The Bulletin of Mathematical Biophysics, vol.5, issue.4, pp.115-133, 1943. ,
DOI : 10.1007/BF02478259
Systèmes de référence pour l'´ evaluation d'applications et la caractérisation de bases de données en reconnaissance automatique de la parole, Proc. of Journées d' ´ Etude sur la Parole (JEP), pp.323-326, 1987. ,
Discriminant AR- Vector Models for Free-Text Speaker Verification, Proc. of EUROSPEECH, pp.161-164, 1993. ,
A new SVM approach to speaker identification and verification using probabilistic distance kernels, Proc. of EUROSPEECH, pp.2965-2968, 2003. ,
Combining evidence from residual phase and MFCC features for speaker recognition, IEEE Signal Processing Letters, vol.13, issue.1, pp.52-55, 2006. ,
DOI : 10.1109/LSP.2005.860538
A decision theorectic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.31, issue.4, pp.31814-817, 1983. ,
DOI : 10.1109/TASSP.1983.1164173
A real-time trained system for robust speaker verification using relative space of anchor models, Computer Speech & Language, vol.24, issue.4, pp.545-561, 2010. ,
DOI : 10.1016/j.csl.2009.07.002
Phonetic speaker recognition using maximum-likelihood binary-decision tree models, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., pp.796-799, 2003. ,
DOI : 10.1109/ICASSP.2003.1202763
Robust voice activity detection using higher-order statistics in the LPC residual domain, IEEE Transactions on Speech and Audio Processing, vol.9, issue.3, pp.217-231, 2001. ,
DOI : 10.1109/89.905996
Numerical optimization, 1999. ,
DOI : 10.1007/b98874
Optimisation of neural models for speaker identification, International Conference on Acoustics, Speech, and Signal Processing, pp.261-264, 1990. ,
DOI : 10.1109/ICASSP.1990.115617
Radial basis function networks for speaker recognition, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing, pp.393-396, 1991. ,
DOI : 10.1109/ICASSP.1991.150359
Training Universal Background Models for Speaker Recognition, Proc. of Odyssey -The Speaker and Language Recognition Workshop, pp.52-57, 2010. ,
Predictive neural networks in text independent speaker verification: an evaluation on the SIVA database, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.2423-2426, 1996. ,
DOI : 10.1109/ICSLP.1996.607298
ASR dependent techniques for speaker identification, Proc. of ICSLP, pp.1337-1340, 2002. ,
Feature warping for robust speaker verification, Proc. of Odyssey -The Speaker and Language Recognition Workshop, pp.213-218, 2001. ,
Modeling of the glottal flow derivative waveform with application to speaker identification, IEEE Transactions on Speech and Audio Processing, vol.7, issue.5, pp.569-586, 1999. ,
DOI : 10.1109/89.784109
Probabilistic Linear Discriminant Analysis for Inferences About Identity, 2007 IEEE 11th International Conference on Computer Vision, pp.1-8, 2007. ,
DOI : 10.1109/ICCV.2007.4409052
NIST Speaker Recognition Evaluation Chronicles - Part 2, 2006 IEEE Odyssey, The Speaker and Language Recognition Workshop, pp.15-22, 2004. ,
DOI : 10.1109/ODYSSEY.2006.248120
A tutorial on hidden Markov models and selected applications in speech recognition, Proceedings of the IEEE, pp.257-286, 1989. ,
Fundamentals of speech recognition, 1993. ,
An Algorithm for Determining the Endpoints of Isolated Utterances, Bell System Technical Journal, vol.54, issue.2, pp.297-315, 1975. ,
DOI : 10.1002/j.1538-7305.1975.tb02840.x
Digital processing of speech signals, 1978. ,
Speaker recognition???general classifier approaches and data fusion methods, Pattern Recognition, vol.35, issue.12, pp.352801-2821, 2002. ,
DOI : 10.1016/S0031-3203(01)00235-7
Efficient voice activity detection algorithms using long-term speech information, Speech Communication, vol.42, issue.3-4, pp.3-4271, 2004. ,
DOI : 10.1016/j.specom.2003.10.002
Comparison of background normalization methods for text-independent speaker verification, Proc. of EUROSPEECH, pp.963-966, 1997. ,
Channel robust speaker verification via feature mapping, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., 2003. ,
DOI : 10.1109/ICASSP.2003.1202292
The SuperSID project : Exploiting high-level information for high-accuracy speaker recognition, Proc. of ICASSP, pp.784-787, 2003. ,
Robust text-independent speaker identification using Gaussian mixture speaker models, IEEE Transactions on Speech and Audio Processing, vol.3, issue.1, pp.72-83, 1995. ,
DOI : 10.1109/89.365379
Evaluation of Spoken Language Recognition Technology Using Broadcast Speech : Performance and Challenges, Proc. of Odyssey -The Speaker and Language Recognition Workshop, 2012. ,
Indexation de documents audio : Cas des grands volumes de données, 2008. ,
Fusing generative and discriminative UBM-based systems for speaker verification, Proc. of International workshop on MMUA, 2006. ,
Speaker identification via support vector classifiers, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, pp.105-108, 1996. ,
DOI : 10.1109/ICASSP.1996.540301
Large margin training of acoustic models for speech recognition, 2007. ,
Large margin Gaussian mixture modeling for phonetic classification and recognition, Proc. of ICASSP, pp.265-268, 2006. ,
Large Margin Hidden Markov Models for Automatic Speech Recognition, Advances in Neural Information Processing Systems 19, pp.1249-1256, 2007. ,
Robust entropy-based endpoint detection for speech recognition in noisy environments, Proc. of ICSLP, 1998. ,
Support vector machine with dynamic time-alignment kernel for speech recognition, Proc. of EUROSPEECH, pp.1841-1844, 2001. ,
Modeling prosodic feature sequences for speaker recognition, Speech Communication, vol.46, issue.3-4, pp.3-4455, 2005. ,
DOI : 10.1016/j.specom.2005.02.018
Using Post-Classifiers to Enhance Fusion of Low- and High-Level Speaker Recognition, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.7, pp.152063-2071, 2007. ,
DOI : 10.1109/TASL.2007.903054
Advances In Channel Compensation For SVM Speaker Recognition, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.629-632, 2005. ,
DOI : 10.1109/ICASSP.2005.1415192
A lognormal tied mixture model of pitch for prosody based speaker recognition, Proc. of EUROSPEECH, pp.1391-1394, 1997. ,
Modeling dynamic prosodic variation for speaker verification, Proc. of ICSLP, pp.3189-3192, 1998. ,
On the use of instantaneous and transitional spectral information in speaker recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.36, issue.6, pp.871-879, 1988. ,
DOI : 10.1109/29.1598
A vector quantization approach to speaker recognition, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.387-390, 1985. ,
DOI : 10.1109/ICASSP.1985.1168412
SVM based textdependent speaker identification for large set of voices, Proc. of EUSIPCO, pp.333-336, 2004. ,
MLLR transforms as features in speaker recognition, Proc. of INTERSPEECH, pp.2425-2428, 2005. ,
Speaker Recognition With Session Variability Normalization Based on MLLR Adaptation Transforms, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.7, pp.151987-1998, 2007. ,
DOI : 10.1109/TASL.2007.902859
Speaker Adaptive Cohort Selection for Tnorm in Text-Independent Speaker Verification, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.741-744, 2005. ,
DOI : 10.1109/ICASSP.2005.1415220
Robust Bayesian mixture modelling, Neurocomputing, vol.64, pp.235-252, 2005. ,
DOI : 10.1016/j.neucom.2004.11.018
Usefulness of the LPC-residue in text-independent speaker verification, Speech Communication, vol.17, issue.1-2, pp.145-157, 1995. ,
DOI : 10.1016/0167-6393(95)00010-L
Voice activity detection using a periodicity measure, IEE Proceedings I Communications, Speech and Vision, pp.377-380, 1992. ,
DOI : 10.1049/ip-i-2.1992.0052
MMIE training of large vocabulary recognition systems, Speech Communication, vol.22, issue.4, pp.303-314, 1997. ,
DOI : 10.1016/S0167-6393(97)00029-0
Semidefinite Programming, SIAM Review, vol.38, issue.1, pp.49-95, 1996. ,
DOI : 10.1137/1038003
Statistical Learning Theory, 1998. ,
Cepstral domain segmental feature vector normalization for noise robust speech recognition, Speech Communication, vol.25, issue.1-3, pp.1-3133, 1998. ,
DOI : 10.1016/S0167-6393(98)00033-8
Discriminant NAP for SVM speaker recognition, Proc. of Odyssey -The Speaker and Language Recognition Workshop, 2008. ,
Support vector machines for speaker verification and identification, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501), pp.775-784, 2000. ,
DOI : 10.1109/NNSP.2000.890157
Polynomial dynamic time warping kernel support vector machines for dysarthric speech recognition with sparse training data, Proc. of INTERSPEECH, pp.3321-3324, 2005. ,
Evaluation of kernel methods for speaker verification and identification, IEEE International Conference on Acoustics Speech and Signal Processing, pp.669-672, 2002. ,
DOI : 10.1109/ICASSP.2002.5743806
Speaker verification using sequence discriminant support vector machines, IEEE Transactions on Speech and Audio Processing, vol.13, issue.2, pp.203-210, 2005. ,
DOI : 10.1109/TSA.2004.841042
Text-independent speaker verification with dynamic trajectory model, IEEE Signal Processing Letters, vol.10, issue.5, pp.141-143, 2003. ,
DOI : 10.1109/LSP.2003.810913
Short-time Gaussianization for robust speaker verification, IEEE International Conference on Acoustics Speech and Signal Processing, pp.681-684, 2002. ,
DOI : 10.1109/ICASSP.2002.5743809
AANN: an alternative to GMM for pattern recognition, Neural Networks, vol.15, issue.3, pp.459-469, 2002. ,
DOI : 10.1016/S0893-6080(02)00019-9
Integration of Complementary Acoustic Features for Speaker Recognition, IEEE Signal Processing Letters, vol.14, issue.3, pp.181-184, 2007. ,
DOI : 10.1109/LSP.2006.884031
Using MAP estimation of feature transformation for speaker recognition, Proc. of INTERSPEECH, pp.849-852, 2008. ,
Joint map adaptation of feature transformation and Gaussian Mixture Model for speaker recognition, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4045-4048, 2009. ,
DOI : 10.1109/ICASSP.2009.4960516
Apprentissage discriminant, Modèles de Mélange de lois Gaussiennes, maximisation de la marge, reconnaissance du locuteur, compensation de la variabilité inter-sessions ,
Online First) Discriminative speaker recognition using Large Margin GMM, Journal of Neural Computing & Applications, pp.10-1007 ,
Fast training of Large Margin diagonal Gaussian mixture models for speaker identification, 2011 6th Conference on Speech Technology and Human-Computer Dialogue (SpeD) ,
DOI : 10.1109/SPED.2011.5940738
URL : https://hal.archives-ouvertes.fr/hal-00647213
Speaker Identification Using Discriminative Learning of Large Margin GMM, Neural Information Processing, pp.300-307, 2011. ,
DOI : 10.1007/b98874
URL : https://hal.archives-ouvertes.fr/hal-00647201
Apprentissage discriminant des GMMàGMMà grande marge pour la vérification automatique du locuteur, 2011. ,
Speaker verification using large margin GMM discriminative training, 2011 International Conference on Multimedia Computing and Systems, pp.1-5, 2011. ,
DOI : 10.1109/ICMCS.2011.5945650
URL : https://hal.archives-ouvertes.fr/hal-00647232
Fast training of Large Margin diagonal Gaussian mixture models for speaker identification, 2011 6th Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp.1-4, 2011. ,
DOI : 10.1109/SPED.2011.5940738
URL : https://hal.archives-ouvertes.fr/hal-00647213
Large Margin Gaussian mixture models for speaker identification, Proc. of INTERSPEECH, pp.1441-1444, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00532781
Cleaning Statistical Language Models, Proc. of SIIE, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00579376
Building Arabic textual corpus from the Web, Proc. of SIIE, 2008. ,