Skip to Main content Skip to Navigation

Constitution d'une ressource sémantique arabe à partir d'un corpus multilingue aligné

Abstract : This study aims at the implementation and evaluation of techniques for extracting semantic relations from a multilingual aligned corpus. Firstly, our observations will focus on the semantic comparison of translational equivalents in multilingual aligned corpus. From these equivalences, we will try to extract "cliques", which ara maximum complete related sub-graphs, where all units are interrelated because of a probable semantic intersection. These cliques have the advantage of giving information on both the synonymy and polysemy of units, and providing a form of semantic disambiguation. Secondly, we attempt to link these cliques with a semantic lexicon (like WordNet) in order to assess the possibility of recovering, for the Arabic units, a semantic relationships already defined for English, French or Spanish units. These relations would automatically build a semantic resource which would be useful for different applications of NLP, such as Question Answering systems, machine translation, alignment systems, Information Retrieval…etc.
Document type :
Complete list of metadata

Cited literature [151 references]  Display  Hide  Download
Contributor : ABES STAR :  Contact
Submitted on : Friday, June 21, 2013 - 2:44:10 PM
Last modification on : Friday, March 25, 2022 - 9:44:39 AM
Long-term archiving on: : Sunday, September 22, 2013 - 4:09:03 AM


Version validated by the jury (STAR)


  • HAL Id : tel-00836764, version 1



Authoul Abdulhay. Constitution d'une ressource sémantique arabe à partir d'un corpus multilingue aligné. Linguistique. Université de Grenoble, 2012. Français. ⟨NNT : 2012GRENL003⟩. ⟨tel-00836764⟩



Record views


Files downloads