Skip to Main content Skip to Navigation

Representation learning of user-generated data

Mickael Poussevin 1 
1 MLIA - Machine Learning and Information Access
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : In this thesis, we study how representation learning methods can be applied to user-generated data. Our contributions cover three different applications but share a common denominator: the extraction of relevant user representations. Our first application is the item recommendation task, where recommender systems build user and item profiles out of past ratings reflecting user preferences and item characteristics. Nowadays, textual information is often together with ratings available and we propose to use it to enrich the profiles extracted from the ratings. Our hope is to extract from the textual content shared opinions and preferences. The models we propose provide another opportunity: predicting the text a user would write on an item. Our second application is sentiment analysis and, in particular, polarity classification. Our idea is that recommender systems can be used for such a task. Recommender systems and traditional polarity classifiers operate on different time scales. We propose two hybridizations of these models: the former has better classification performance, the latter highlights a vocabulary of surprise in the texts of the reviews. The third and final application we consider is urban mobility. It takes place beyond the frontiers of the Internet, in the physical world. Using authentication logs of the subway users, logging the time and station at which users take the subway, we show that it is possible to extract robust temporal profiles.
Document type :
Complete list of metadata

Cited literature [156 references]  Display  Hide  Download
Contributor : ABES STAR :  Contact
Submitted on : Friday, January 22, 2016 - 1:01:29 AM
Last modification on : Saturday, July 9, 2022 - 3:28:45 AM
Long-term archiving on: : Saturday, April 23, 2016 - 10:11:30 AM


Version validated by the jury (STAR)


  • HAL Id : tel-01260338, version 1


Mickael Poussevin. Representation learning of user-generated data. Other [cs.OH]. Université Pierre et Marie Curie - Paris VI, 2015. English. ⟨NNT : 2015PA066040⟩. ⟨tel-01260338⟩



Record views


Files downloads