Skip to Main content Skip to Navigation

Environnements centralisés et distribués pour lexicographes et lexicologues en contexte multilingue

Abstract : The growing needs in lexical resources and the success of the cooperative development projects such as LINUX lead to the idea of accumulating large amounts of very rich multilingual lexical data by cooperative construction on the Web and "mutualized" use. Contributions to data improvement would be standardized and made available thanks to an adapted environment. While studying of the current context of the dictionaries domain, we were led to identifying difficult problems such as heterogeneous data structuring and manipulation, as well as large amount of multilingual lexical data and visualization or construction in cooperation by people with different skills. Prototypes and experiments on consultation of heterogeneous resources, enrichment and personalization of the result, on-line resource building, and entries writing with a standard editor enabled us to solve these problems separately. It allowed us to design a complete lexical databases environment addressing all these problems as a specific layer directly above the DBMS tools that integrates a server for cooperative building. Its kernel includes a generic formalism for the definition of lexical structures derived from SUBLIM of G. Sérasset, but extended and translated in XML. This environment is currently applied to the Papillon project which aims at building/developing of a five-language lexical database by voluntary contributors on the Internet. The architecture of the database is made up of a monolingual dictionary for each language and a pivot dictionary of interlingual acceptions (axies) linking the monolingual entries (lexies) which structure comes from the domain of explanatory and combinatory lexicology. Extra languages are planned to be added soon. The architecture of the server is quite generic and could be reused rapidly in other contexts (translation memories and tools for translators, communication and multilingual IR, multimedia annotations).
Document type :
Complete list of metadata

Cited literature [100 references]  Display  Hide  Download
Contributor : Mathieu Mangeot <>
Submitted on : Tuesday, June 22, 2004 - 6:37:42 PM
Last modification on : Friday, November 6, 2020 - 4:05:32 AM
Long-term archiving on: : Wednesday, September 12, 2012 - 4:20:10 PM


  • HAL Id : tel-00006311, version 1




Mathieu Mangeot. Environnements centralisés et distribués pour lexicographes et lexicologues en contexte multilingue. Informatique et langage [cs.CL]. Université Joseph-Fourier - Grenoble I, 2001. Français. ⟨tel-00006311⟩



Record views


Files downloads