Skip to Main content Skip to Navigation

Prominent microblog users prediction during crisis events : using phase-aware and temporal modeling of users behavior

Abstract : During crisis events such as disasters, the need of real-time information retrieval (IR) from microblogs remains inevitable. However, the huge amount and the variety of the shared information in real time during such events over-complicate this task. Unlike existing IR approaches based on content analysis, we propose to tackle this problem by using user-centricIR approaches with solving the wide spectrum of methodological and technological barriers inherent to : 1) the collection of the evaluated users data, 2) the modeling of user behavior, 3) the analysis of user behavior, and 4) the prediction and tracking of prominent users in real time. In this context, we detail the different proposed approaches in this dissertation leading to the prediction of prominent users who are susceptible to share the targeted relevant and exclusive information on one hand and enabling emergency responders to have a real-time access to the required information in all formats (i.e. text, image, video, links) on the other hand. These approaches focus on three key aspects of prominent users identification. Firstly, we have studied the efficiency of state-of-the-art and new proposed raw features for characterizing user behavior during crisis events. Based on the selected features, we have designed several engineered features qualifying user activities by considering both their on-topic and off-topic shared information. Secondly, we have proposed a phase-aware user modeling approach taking into account the user behavior change according to the event evolution over time. This user modeling approach comprises the following new novel aspects (1) Modeling microblog users behavior evolution by considering the different event phases (2) Characterizing users activity over time through a temporal sequence representation (3) Time-series-based selection of the most discriminative features characterizing users at each event phase. Thirdly, based on this proposed user modeling approach, we train various prediction models to learn to differentiate between prominent and non-prominent users behavior during crisis event. The learning task has been performed using SVM and MoG-HMMs supervised machine learning algorithms. The efficiency and efficacy of these prediction models have been validated thanks to the data collections extracted by our multi-agents system MASIR during two flooding events who have occured in France and the different ground-truths related to these collections.
Document type :
Complete list of metadatas

Cited literature [96 references]  Display  Hide  Download
Contributor : Abes Star :  Contact
Submitted on : Wednesday, December 13, 2017 - 4:44:38 PM
Last modification on : Tuesday, October 20, 2020 - 10:59:19 AM


Version validated by the jury (STAR)


  • HAL Id : tel-01663067, version 1



Imen Bizid. Prominent microblog users prediction during crisis events : using phase-aware and temporal modeling of users behavior. Information Retrieval [cs.IR]. Université de La Rochelle, 2016. English. ⟨NNT : 2016LAROS026⟩. ⟨tel-01663067⟩



Record views


Files downloads