Skip to Main content Skip to Navigation
Theses

Énumération exhaustive et détection spécifique des analogies : étude pour les modèles de langue et la traduction automatique

Julien Gosme 1
1 Equipe Hultech - Laboratoire GREYC - UMR6072
GREYC - Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen
Abstract : The research presented in this PhD thesis is in the machine translation field. By studying the foundations of example-based machine translation, especially in the Aleph system, we bring to light the problem of example selection. The Aleph system uses exclusively the operation of analogy to produce new sentences and new translations. The problem is to select the adequate sentences from a large corpus of examples to allow for the production of new sentences by analogy. Our first contribution consists in the design of a method for the complete enumeration of all analogies contained in a text. This method allows us to complete a statistical study of the most frequent analogies between word trigrams and to bring to light the most frequent patterns of analogy. These results allow us to design a new smoothing technique for trigram language models based on a small amount of patterns of analogy. We report experiments which show that this new smoothing technique outperforms classical methods.
Document type :
Theses
Complete list of metadatas

Cited literature [43 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00700559
Contributor : Julien Gosme <>
Submitted on : Wednesday, May 23, 2012 - 2:28:10 PM
Last modification on : Tuesday, February 5, 2019 - 12:12:40 PM
Long-term archiving on: : Friday, November 30, 2012 - 12:05:48 PM

Identifiers

  • HAL Id : tel-00700559, version 1

Citation

Julien Gosme. Énumération exhaustive et détection spécifique des analogies : étude pour les modèles de langue et la traduction automatique. Informatique et langage [cs.CL]. Université de Caen, 2012. Français. ⟨tel-00700559⟩

Share

Metrics

Record views

503

Files downloads

382