Un environnement générique et ouvert pour le traitement des expressions polylexicales : de l'acquisition aux applications

Abstract : This thesis presents an open and flexible methodological framework for the automatic acquisition of multiword expressions (MWEs) from monolingual textual corpora. This research is motivated by the importance of MWEs for NLP applications. After briefly presenting the modules of the framework, the work reports extrinsic evaluation results considering two applications: computer-aided lexicography and statistical machine translation. Both applications can benefit from automatic MWE acquisition and the expressions acquired automatically from corpora can both speed up and improve their quality. The promising results of our experiments encourage further investigation about the optimal way to integrate MWE treatment into these and many other applications.
Document type :
Theses
Liste complète des métadonnées

Cited literature [300 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00741147
Contributor : Abes Star <>
Submitted on : Monday, September 9, 2013 - 4:27:15 PM
Last modification on : Thursday, October 11, 2018 - 8:48:02 AM
Document(s) archivé(s) le : Tuesday, December 10, 2013 - 4:54:14 AM

File

29835_RAMISCH_2012_archivage1....
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-00741147, version 2

Citation

Carlos Eduardo Ramisch. Un environnement générique et ouvert pour le traitement des expressions polylexicales : de l'acquisition aux applications. Autre [cs.OH]. Université de Grenoble, 2012. Français. ⟨NNT : 2012GRENM059⟩. ⟨tel-00741147v2⟩

Share

Metrics

Record views

676

Files downloads

1266