Skip to Main content Skip to Navigation

Acquisition automatique de traductions d'unités lexicales complexes à partir du Web

Stéphanie Léon 1
1 TEXTE - Exploration et exploitation de données textuelles
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Recently, machine translation has made significant progress by using lists of compound expressions such as " armed robbery " (" vol à main armée " in French). However, current systems are not yet capable of translating correctly many unknown expressions, such as "caisse centrale", mistranslated by Systran as "central case" instead of "central fund". This expression could have been automatically translated using the Web as a source of examples. The aim of this study is to acquire automatically a database of complex lexical units translations from the Web. We especially tackle difficult translation problems such as polysemy and idioms, for which better translation mechanisms are proposed. Beyond the practical aspects of machine translation improvement, we discuss the possible uses of the Web for general studies in linguistics.
Document type :
Complete list of metadatas

Cited literature [305 references]  Display  Hide  Download
Contributor : Stephanie Leon <>
Submitted on : Friday, February 6, 2009 - 1:28:35 PM
Last modification on : Monday, October 19, 2020 - 11:12:48 AM
Long-term archiving on: : Tuesday, June 8, 2010 - 8:18:48 PM


  • HAL Id : tel-00359235, version 1



Stéphanie Léon. Acquisition automatique de traductions d'unités lexicales complexes à partir du Web. Informatique [cs]. Université de Provence - Aix-Marseille I, 2008. Français. ⟨tel-00359235⟩



Record views


Files downloads