Skip to Main content Skip to Navigation

Méthodes d'acquisition terminologique en arabe : Application au domaine médical

Abstract : The goal of this thesis is to reduce the lack of available resources and NLP tools for Arabic language in specialised domains by proposing methods allowing the extraction of terms from texts in Modern Standard Arabic. In this context, we first constructed an English-Arabic parallel corous in a specific domain.It is a set of medical texts produced by the US National Library of Medicine (NLM). Thereafter, we have proposed terminological acquisition methods, toextract terms or acquire relations between these terms, for Arabic based on: i) the adaptation of an existing terminology extractor for French or English, ii) the transliteration of English terms in Arabic characters and iii) cross-lingual transfer. Applied at the terminological level, transfer aims to implement a process of term extraction or relationship acquisition between terms in the texts of a source language (here, French or English) and then to transfer the extracted information to target language texts (in this case, Modern Standard Arabic), thereby identifying the same type of terminologicalinformation. We have evaluated the monolingual and bilingual term lists that we have obtained by the experiments we carried out, according to a transparent, direct and semi-automatic method: the extracted term candidates are confronted with a reference terminology before being validated manually. This evaluation follows a protocol that we proposed.
Complete list of metadatas

Cited literature [114 references]  Display  Hide  Download
Contributor : Abes Star :  Contact
Submitted on : Tuesday, October 22, 2019 - 3:37:07 PM
Last modification on : Wednesday, October 14, 2020 - 3:41:35 AM
Long-term archiving on: : Thursday, January 23, 2020 - 9:16:50 PM


Version validated by the jury (STAR)


  • HAL Id : tel-02326714, version 1


Wafa Neifar. Méthodes d'acquisition terminologique en arabe : Application au domaine médical. Informatique et langage [cs.CL]. Université Paris-Saclay; Université de Sfax (Tunisie). Faculté des Sciences économiques et de gestion, 2019. Français. ⟨NNT : 2019SACLS085⟩. ⟨tel-02326714⟩



Record views


Files downloads