Skip to Main content Skip to Navigation

Une approche neuronale pour l’analyse d’opinions en arabe

Abstract : My thesis is part of Arabic sentiment analysis. Its aim is to determine the global polarity of a given textual statement written in MSA or dialectal arabic. This research area has been subject of numerous studies dealing with Indo-European languages, in particular English. One of difficulties confronting this thesis is the processing of Arabic. In fact, Arabic is a morphologically rich language which implies a greater sparsity : we want to overcome this problem by producing, in a completely automatic way, new arabic specific embeddings. Our study focuses on the use of a neural approach to improve polarity detection, using embeddings. These embeddings have revealed fundamental in various natural languages processing tasks (NLP). Our contribution in this thesis concerns several axis. First, we begin with a preliminary study of the various existing pre-trained word embeddings resources in arabic. These embeddings consider words as space separated units in order to capture semantic and syntactic similarities in the embedding space. Second, we focus on the specifity of Arabic language. We propose arabic specific embeddings that take into account agglutination and morphological richness of Arabic. These specific embeddings have been used, alone and in combined way, as input to neural networks providing an improvement in terms of classification performance. Finally, we evaluate embeddings with intrinsic and extrinsic methods specific to sentiment analysis task. For intrinsic embeddings evaluation, we propose a new protocol introducing the notion of sentiment stability in the embeddings space. We propose also a qualitaive extrinsic analysis of our embeddings by using visualisation methods.
Document type :
Complete list of metadata
Contributor : ABES STAR :  Contact
Submitted on : Monday, December 21, 2020 - 10:53:06 AM
Last modification on : Wednesday, January 26, 2022 - 5:33:16 PM


Version validated by the jury (STAR)


  • HAL Id : tel-03084468, version 1


Amira Barhoumi. Une approche neuronale pour l’analyse d’opinions en arabe. Informatique et langage [cs.CL]. Le Mans Université; Université de Sfax (Tunisie), 2020. Français. ⟨NNT : 2020LEMA1022⟩. ⟨tel-03084468⟩



Record views


Files downloads