Multilinguisation d'ontologies dans le cadre de la recherche d'information translingue dans des collections d'images accompagnées de textes spontanés

Abstract : The World Wide Web is a proliferating source of multimedia objects described using various natural languages. In order to use semantic Web techniques for retrieval of such objects (images, videos, etc.), we propose a content extraction method in multilingual text collections, using one or several ontologies as parameters. The content extraction process is used on the one hand to index multimedia objects using their textual content, and on the other to build formal requests from spontaneous user requests. The process is based on an interlingual annotation of texts, keeping ambiguities (polysemy and segmentation) in graphs. This first step allows using common desambiguation processes at th elevel of a pivot langage (interlingual lexemes). Passing an ontology as a parameter of the system is done by aligning automatically its elements with the interlingual lexemes of the pivot language. It is thus possible to use ontologies that have not been built for a specific use in a multilingual context, and to extend the set of languages and their lexical coverages without modifying the ontologies. A demonstration software for multilingual image retrieval has been built with the proposed approach in the framework of the OMNIA ANR project, allowing to implement the proposed approaches. It has thus been possible to evaluate the scalability and quality of annotations produiced during the retrieval process.
Document type :
Theses
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00743652
Contributor : Abes Star <>
Submitted on : Friday, October 19, 2012 - 3:52:19 PM
Last modification on : Tuesday, February 12, 2019 - 1:31:00 AM
Long-term archiving on : Sunday, January 20, 2013 - 3:43:04 AM

File

21440_ROUQUET_2012_archivage1....
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-00743652, version 1

Collections

Citation

David Rouquet. Multilinguisation d'ontologies dans le cadre de la recherche d'information translingue dans des collections d'images accompagnées de textes spontanés. Autre [cs.OH]. Université de Grenoble, 2012. Français. ⟨NNT : 2012GRENM031⟩. ⟨tel-00743652⟩

Share

Metrics

Record views

955

Files downloads

2149