Skip to Main content Skip to Navigation

Transcription automatique de la parole spontanée

Abstract : Automatic speech recognition (ASR) systems currently reach enough performance to be integrated in various applications (human-machine dialogue, information retrieval, automatic indexing ...). However, in the context of large vocabulary speech recognition, which is used eg for transcribing radio broadcast, the quality of transcripts varies depending on the type of speech contained in the documents. Indeed, the ASR system performance are much better when transcribing prepared speech, close to a read text . Transcribing pontaneous speech is much more dificult, as it is characterized by many features (disfluencies, ungrammaticality, decreased the fluidity of speech...). This thesis work is the treatment of spontaneous speech and is part of the EPAC project. The main objective is to propose solutions to improve the ASR performance on this type of speech. We chose to address in our work, spontaneous speech as a particular object of study requiring specific treatments. Thus, in a first step, we propose a tool for automatic detection of spontaneous speech, based on its specificities. This tool is very important because it allows us, in a second time, to propose an approach for acoustic and language model adaptation of the ASR system on spontaneous speech without adding data, by automatically selecting the segments containing this type of speech. The transcript resulting from this adaptation offers recognition hypotheses different from those provided by the base system. The combination of these two proposals transcription show a significant reduction of the word error rate. This need for specific solutions finally facing some of our work toward correcting a specific problem, especially present in French: homophony. We then seek to correct the transcripts provided by an ASR system, using a method offering specific solutions to specific problems of homophony. The approach focuses on correcting errors, to which a particular solution is proposed. This post-processing method of ASR systems corrects some classes of words and homophones, regardless of the ASR system used.
Document type :
Complete list of metadata

Cited literature [119 references]  Display  Hide  Download
Contributor : Richard Dufour Connect in order to contact the contributor
Submitted on : Tuesday, May 24, 2011 - 6:18:59 PM
Last modification on : Friday, November 12, 2021 - 11:18:09 AM
Long-term archiving on: : Friday, November 9, 2012 - 12:06:35 PM


  • HAL Id : tel-00595465, version 1


Richard Dufour. Transcription automatique de la parole spontanée. Informatique [cs]. Université du Maine, 2010. Français. ⟨tel-00595465⟩



Record views


Files downloads