Construction semi-automatique de ressources pour la fouille d'opinion

Résumé : Identifying satisfaction triggers among customers is a crucial need in today's business world, as a strong customer relationship is now a most vital asset. The domain of opinion mining, in which this thesis falls into, offers several methods to answer this need. These methods, however, require a continuous update of specialized resources which are the cornerstone of many opinion mining tools. The objective of this work is to develop acquisition and structuration strategies for these resources, which can be lexicons, morphosyntactic rules or annotated data. Each of these items presents its own extraction difficulties, on top of the general issue of their update in a language- or domain-specific setting. Indeed, language constraints are fundamental in opinion mining, so the proposed methods must take these into account. First, we study the core elements from which opinion expressions are built in customer feedback. This study leads us to suggest a new modelisation of opinion mining as a sequence labeling task. We then compare the benefits of each type of resource through a benchmark of several opinion mining methods, and conclude that the best performing strategy is a hybrid approach. Finally, we present results for resource acquisition methods that answer not only the needs of opinion mining but also the constraints from the industrial setting in which this work has been conducted.
Type de document :
Thèse
Traitement du texte et du document. Université de Nantes, Faculté des sciences et des techniques., 2017. Français
Liste complète des métadonnées

Littérature citée [161 références]  Voir  Masquer  Télécharger

https://tel.archives-ouvertes.fr/tel-01630619
Contributeur : Joseph Lark <>
Soumis le : mardi 7 novembre 2017 - 19:08:00
Dernière modification le : jeudi 19 avril 2018 - 11:46:05
Document(s) archivé(s) le : jeudi 8 février 2018 - 15:11:50

Fichier

these-jlark-tel-version.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : tel-01630619, version 1

Collections

Citation

Joseph Lark. Construction semi-automatique de ressources pour la fouille d'opinion. Traitement du texte et du document. Université de Nantes, Faculté des sciences et des techniques., 2017. Français. 〈tel-01630619〉

Partager

Métriques

Consultations de la notice

231

Téléchargements de fichiers

253