Skip to Main content Skip to Navigation
Theses

Automatic, adaptive, and applicative sentiment analysis

Abstract : Sentiment analysis is a challenging task today for computational linguistics. Because of the rise of the social Web, both the research and the industry are interested in automatic processing of opinions in text. In this work, we assume a multilingual and multidomain environment and aim at automatic and adaptive polarity classification.We propose a method for automatic construction of multilingual affective lexicons from microblogging to cover the lack of lexical resources. To test our method, we have collected over 2 million messages from Twitter, the largest microblogging platform, and have constructed affective resources in English, French, Spanish, and Chinese.We propose a text representation model based on dependency parse trees to replace a traditional n-grams model. In our model, we use dependency triples to form n-gram like features. We believe this representation covers the loss of information when assuming independence of words in the bag-of-words approach.Finally, we investigate the impact of entity-specific features on classification of minor opinions and propose normalization schemes for improving polarity classification. The proposed normalization schemes gives more weight to terms expressing sentiments and lower the importance of noisy features.The effectiveness of our approach has been proved in experimental evaluations that we have performed across multiple domains (movies, product reviews, news, blog posts) and multiple languages (English, French, Russian, Spanish, Chinese) including official participation in several international evaluation campaigns (SemEval'10, ROMIP'11, I2B2'11).
Document type :
Theses
Complete list of metadatas

https://tel.archives-ouvertes.fr/tel-00717329
Contributor : Abes Star :  Contact
Submitted on : Thursday, July 12, 2012 - 3:22:23 PM
Last modification on : Monday, December 14, 2020 - 9:55:24 AM
Long-term archiving on: : Saturday, October 13, 2012 - 2:35:17 AM

Identifiers

  • HAL Id : tel-00717329, version 1

Collections

Citation

Alexander Pak. Automatic, adaptive, and applicative sentiment analysis. Other [cs.OH]. Université Paris Sud - Paris XI, 2012. English. ⟨NNT : 2012PA112101⟩. ⟨tel-00717329⟩

Share

Metrics

Record views

1529

Files downloads

3084