du système SPEERAL et du système IRÈNE sur les trois heures issues du corpus de développement, p.70 ,
BONG-SPEERAL) et de l'IRISA (BONG-SPEERAL) avec (P2) et sans (P1) adaptation acoustique, p.74 ,
BONG-SPEERAL) et de l'IRISA (BONG-SPEERAL) avec (P2) et sans (P1) adaptation acoustique, p.74 ,
un ROVER sans (Rover-4-2-0-1) et avec (Rover-(4-BONG-2-0-1)) la sortie de la combinaison BONG, p.80 ,
Avancées dans le domaine de la transcription automatique par décodage guidé, pp.4-09, 2012. ,
Yannick Estève : LIUM's systems for the IWSLT 2011 Speech Translation Tasks, IWSLT, pp.8-9, 2011. ,
Some recent research work at LIUM based on the use of CMU Sphinx, Mars 13, 2010. ,
Unsupervised model adaptation on targeted speech segments for LVCSR system combination, Interspeech, vol.2010, pp.26-30, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-01433900
Hervé Blanchon : LIG approach for IWSLT09 : Using Multiple Morphological Segmenters for Spoken Language Translation of Arabic Amélioration de la combinaison de systèmes de reconnaissance de la parole par décodage guidé, IWSLT, pp.25-27, 2009. ,
A compact model for speaker-adaptive training, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.1137-1140, 1996. ,
DOI : 10.1109/ICSLP.1996.607807
An overview of decoding techniques for large vocabulary continuous speech recognition, Computer Speech & Language, vol.16, issue.1, pp.89-114, 2002. ,
Statistical inference for probabilistic functions of finite state Markov chains, Annals of Mathematical Statistics, vol.37, pp.1554-1563, 1966. ,
Bag of n-gram driven decoding for LVCSR system harnessing, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, pp.11-15, 2011. ,
DOI : 10.1109/ASRU.2011.6163944
URL : https://hal.archives-ouvertes.fr/hal-01434931
Low latency combination of parallelized single-pass LVCSR systems, In Interspeech, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-01313238
Generating complementary systems for speech recognition, Interspeech, 2006. ,
Directed decision trees for generating complementary systems, Speech Communication, vol.51, issue.3, pp.284-295, 2009. ,
DOI : 10.1016/j.specom.2008.09.004
URL : https://hal.archives-ouvertes.fr/hal-00499235
Class-Based n-gram Models of Natural Language, Computational Linguistics, vol.18, issue.23, pp.467-479, 1992. ,
Measurement of Complementarity of Recognition Systems, Proceedings of the 5th International Conference on Text, Speech and Dialogue, pp.283-290, 2004. ,
DOI : 10.1007/978-3-540-30120-2_36
An empirical study of smoothing techniques for language modeling, Computer Speech and Language, pp.359-394, 1999. ,
A new framework for system combination based on integrated hypothesis space, Interspeech, 2006. ,
Opportunities and challenges of parallelizing speech recognition, Proceedings of the 2nd USENIX conference on Hot topics in parallelism, pp.2010-85, 2010. ,
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences Readings in speech recognition, pp.65-74, 1990. ,
The LIUM Speech Transcription System : a CMU Sphinx IIIbased System for French Broadcast News, Interspeech 2005, pp.1653-1656, 2005. ,
Improvements to the LIUM French ASR system based on CMU sphinx : what helps to significantly reduce the word error rate ? In Interspeech, pp.2123-2126, 2009. ,
Maximum Likelihood from Incomplete Data via the EM Algorithm, Journal of the Royal Statistical Society, vol.39, pp.1-38, 1977. ,
Stream combination before and/or after the acoustic model, ICASSP, pp.1635-1638, 2000. ,
Systèmes de transcription automatique de la parole et logiciels libres, Traitement Automatique des Langues, vol.37, 2004. ,
Frédéric Béchet et Jérôme Farinas. The EPAC corpus : manual and automatic annotations of conversational speech in French broadcast news, LREC 2010, Malta, pp.17-23, 2010. ,
Posterior Probability Decoding , Confidence Estimation And System Combination, Proceedings NIST Speech Transcription Workshop, 2000. ,
Overview of the IWSLT 2011 Evaluation Campaign, Proceedings of the International Workshop on Spoken Language Translation (IWSLT), 2011. ,
A post-prcessing system to yield reduced word error rates : Recogniser Output Voting Error Reduction (ROVER), ASRU, pp.347-354, 1997. ,
A decision-theoretic generalization of on-linelearning and an application to boosting, European Conference on Computational Learning Theory (EUROCOLT), Mars 1995. (Cité en, p.47, 1995. ,
The Generation And Use Of Regression Class Trees For Mllr Adaptation, 1996. ,
Mean and variance adaptation within the MLLR framework, Computer Speech & Language, vol.10, issue.4, pp.249-264, 1996. ,
DOI : 10.1006/csla.1996.0013
Maximum likelihood linear transformations for HMM-based speech recognition, Computer Speech & Language, vol.12, issue.2, pp.75-98, 1998. ,
DOI : 10.1006/csla.1998.0043
Progress in the CU-HTK broadcast news transcription system, IEEE transactions speech and audio processing, 2006. ,
DOI : 10.1109/TASL.2006.878264
Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News, Proceedings of the 5th Intl. Conf. on Language Resources and Evaluations, 2006. ,
Highperformance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation, Interspeech, pp.2098-2101, 2008. ,
Linear discriminant analysis for improved large vocabulary continuous speech recognition, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.13-16, 1992. ,
DOI : 10.1109/ICASSP.1992.225984
A Formal Basis for the Heuristic Determination of Minimum Cost Paths, IEEE Transactions on Systems Science and Cybernetics, vol.4, issue.2, pp.100-107, 1968. ,
DOI : 10.1109/TSSC.1968.300136
ROVER, Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers on XX, NAACL '07, pp.65-68, 2007. ,
DOI : 10.3115/1614108.1614125
Frame based system combination and a comparison with weighted ROVER and CNC, Interspeech, 2006. ,
Morphosyntactic Processing of N-Best Lists for Improved Recognition and Confidence Measure Computation, Eurospeech'07, pp.1741-1744, 2007. ,
Modeling long distance dependence in language: topic mixtures vs. dynamic cache models, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.236-239, 1996. ,
DOI : 10.1109/ICSLP.1996.607085
Perplexity???a measure of the difficulty of speech recognition tasks, The Journal of the Acoustical Society of America, vol.62, issue.S1, p.63, 1977. ,
DOI : 10.1121/1.2016299
Continuous speech recognition, ACM SIGART Bulletin, issue.61, pp.33-34, 1977. ,
DOI : 10.1145/1045283.1045302
Interpolated estimation of Markov source parameters from sparse data, Proceedings, Workshop on Pattern Recognition in Practice. Amsterdam, 1980. ,
A comparison of two LVR search optimization techniques, Interspeech, 2002. ,
Improved clustering techniques for class-based statistical language modeling, Proceedings of the European Conference on Speech Communication and Technology (Eurospeech), 1993. ,
Improved backing-off for M-gram language modeling, 1995 International Conference on Acoustics, Speech, and Signal Processing, pp.22-41, 1995. ,
DOI : 10.1109/ICASSP.1995.479394
Use of Gaussian selection in large vocabulary continuous speech recognition using HMMS, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996. ,
DOI : 10.1109/ICSLP.1996.607156
A cache-based natural language model for speech recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.12, issue.6, p.53, 1990. ,
DOI : 10.1109/34.56193
The LIMSI 2006 TC-STAR EPPS Transcription Systems, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, pp.123-128, 2006. ,
DOI : 10.1109/ICASSP.2007.367240
Trigger-based language models: a maximum entropy approach, IEEE International Conference on Acoustics Speech and Signal Processing, pp.45-48, 1993. ,
DOI : 10.1109/ICASSP.1993.319225
Imperfect transcript driven speech recognition, ICSLP /Interspeech, pp.62-66, 2006. ,
System Combination by Driven Decoding, ICASSP, 2007. (Cité en pages 8, pp.66-67, 2007. ,
Reconnaissance automatique de la parole guidée par des transcriptions a priori, 2008. ,
Generalized driven decoding for speech recognition system combination, ICASSP, 2008. ,
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, Computer Speech & Language, vol.9, issue.2, pp.171-185, 1995. ,
DOI : 10.1006/csla.1995.0010
Combining search spaces of heterogeneous recognizers for improved speech recognition, Proc. ICSLP, 2002. ,
Use of contexts in language model interpolation and adaptation, Interspeech, pp.360-363, 2009. ,
DOI : 10.1016/j.csl.2012.06.004
Language model cross adaptation for LVCSR system combination, Interspeech, pp.342-345, 2010. ,
DOI : 10.1016/j.csl.2012.07.010
Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation, Interspeech, pp.2857-2860, 2011. ,
The RWTH 2007 TC-STAR Evaluation System for European English and Spanish, Interspeech, 2007. ,
Finding Consensus Among Words : Lattice-Based Word Error Minimization, Proc. Eurospeech, pp.495-498, 1999. ,
DOI : 10.1006/csla.2000.0152
URL : http://arxiv.org/abs/cs/0010012
Linear prediction of speech, 1982. ,
Weighted finite-state transducers in speech recognition, Computer Speech & Language, vol.16, issue.1, pp.69-88, 2002. ,
DOI : 10.1006/csla.2001.0184
The LIA's French broadcast news transcription system, SWIM : Lectures by Masters in Speech Processing, 2004. ,
Look-Ahead Techniques for Fast Beam Search, ICASSP, 1997. ,
Language-model look-ahead for large vocabulary speech recognition, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.2095-2098, 1998. ,
DOI : 10.1109/ICSLP.1996.607215
Integration of diverse recognition methodologies through reevaluation of N-best sentence hypotheses, Proceedings of the workshop on Speech and Natural Language , HLT '91, pp.83-87, 1991. ,
DOI : 10.3115/112405.112416
Tools for the analysis of benchmark speech recognition tests, International Conference on Acoustics, Speech, and Signal Processing, pp.97-100, 1990. ,
DOI : 10.1109/ICASSP.1990.115546
The Kaldi Speech Recognition Toolkit, IEEE 2011 ASRU, Décembre 2011. (Cité en, p.27, 2011. ,
Comparison of MFCC and PLP parameterizations in the speaker independent continuous speech recognition task, Interspeech, pp.1813-1816, 2001. ,
Readings in speech recognition In A tutorial on hidden Markov models and selected applications in speech recognition, pp.267-296, 1990. ,
Holger Schwenk et Yannick Estève Overview of the IWSLT 2011 Evaluation Campaign. In LIUM's systems for the IWSLT 2011 Speech Translation Tasks, 2011. ,
Towards automatic closed captioning : low latency real time broadcast news transcription, Interspeech. ISCA, 2002. ,
The boosting approach to machine learning : An overview. Nonlinear Estimation and Classification, 2003. ,
Combining multiple speech recognizers using voting and language model information, Interspeech, pp.915-918, 2000. ,
Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), pp.273-276, 2001. ,
DOI : 10.1109/ICASSP.2001.940820
Constructing ensembles of ASR systems using randomized decision trees, IEEE ICASSP, pp.197-200, 2005. ,
Comparison and Combination of Confidence Measures, Proceedings of the 5th International Conference on Text, Speech and Dialogue, pp.181-188, 2002. ,
DOI : 10.1007/3-540-46154-X_25
Cross-System Adaptation and Combination for Continuous Speech Recognition : The Influence of Phoneme Set and Acoustic Front-End ? In Interspeech, p.53, 2006. ,
Compensating Acoustic Mismatch Using Class-Based Histogram Equalization for Robust Speech Recognition, EURASIP Journal on Advances in Signal Processing, vol.2007, issue.1, 2007. ,
DOI : 10.1016/S0167-6393(00)00048-0
Combining outputs of multiple LVCSR models by machine learning, Septembre 2005. (Cité en, pp.9-15, 2005. ,
DOI : 10.1002/scj.20340
MMIE training of large vocabulary recognition systems, Speech Communication, vol.22, issue.4, pp.303-314, 1997. ,
DOI : 10.1016/S0167-6393(97)00029-0
Lattice Segmentation and Support Vector Machines for Large Vocabulary Continuous Speech Recognition, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.817-820, 2005. ,
DOI : 10.1109/ICASSP.2005.1415239
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, IEEE Transactions on Information Theory, vol.13, issue.2, 1967. ,
Confidence measures for large vocabulary continuous speech recognition, IEEE Transactions on Speech and Audio Processing, vol.9, issue.3, pp.288-298, 2001. ,
DOI : 10.1109/89.906002
Incremental on-line feature space MLLR adaptation for telephony speech recognition, Interspeech, 2002. ,
Vers le temps réel en transcription automatique de la parole grand vocabulaire. These, Télécom ParisTech, 2007. ,