Skip to Main content Skip to Navigation
Theses

Constitution d'une ressource sémantique arabe à partir d'un corpus multilingue aligné

Abstract : This study aims at the implementation and evaluation of techniques for extracting semantic relations from a multilingual aligned corpus. Firstly, our observations will focus on the semantic comparison of translational equivalents in multilingual aligned corpus. From these equivalences, we will try to extract "cliques", which ara maximum complete related sub-graphs, where all units are interrelated because of a probable semantic intersection. These cliques have the advantage of giving information on both the synonymy and polysemy of units, and providing a form of semantic disambiguation. Secondly, we attempt to link these cliques with a semantic lexicon (like WordNet) in order to assess the possibility of recovering, for the Arabic units, a semantic relationships already defined for English, French or Spanish units. These relations would automatically build a semantic resource which would be useful for different applications of NLP, such as Question Answering systems, machine translation, alignment systems, Information Retrieval…etc.
Document type :
Theses
Complete list of metadatas

Cited literature [151 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00836764
Contributor : Abes Star :  Contact
Submitted on : Friday, June 21, 2013 - 2:44:10 PM
Last modification on : Thursday, November 19, 2020 - 3:40:11 PM
Long-term archiving on: : Sunday, September 22, 2013 - 4:09:03 AM

File

18576_ABDULHAY_2012_archivage....
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-00836764, version 1

Collections

Citation

Authoul Abdulhay. Constitution d'une ressource sémantique arabe à partir d'un corpus multilingue aligné. Linguistique. Université de Grenoble, 2012. Français. ⟨NNT : 2012GRENL003⟩. ⟨tel-00836764⟩

Share

Metrics

Record views

1206

Files downloads

6187