Skip to Main content Skip to Navigation
Theses

Discours de presse et veille stratégique d'événements Approche textométrique et extraction d'informations pour la fouille de textes

Abstract : This research demonstrates two methods of text mining for strategic monitoring purposes: information extraction and Textometry. In strategic monitoring, text mining is used to automatically obtain information on the activities of corporations. For this objective, information extraction identifies and labels units of information, named entities (companies, places, people), which then constitute entry points for the analysis of economic activities or events. These include mergers, bankruptcies, partnerships, etc., involving corresponding corporations. A Textometric method, however, uses several statistical models to study the distribution of words in large corpora, with the goal of shedding light on significant characteristics of the textual data. In this research, Textometry, an approach traditionally considered incompatible with information extraction methods, is applied to the same corpus as an information extraction procedure in order to obtain information on economic events. Several textometric analyses (characteristic elements, co-occurrences) are examined on a corpus of online news feeds. The results are then compared to those produced by the information extraction procedure. Both approaches contribute differently to processing textual data, producing complementary analyses of the corpus. Following the comparison, this research presents the advantages for these two text mining methods in strategic monitoring of current events.
Complete list of metadatas

Cited literature [151 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00740601
Contributor : Erin Macmurray <>
Submitted on : Wednesday, October 10, 2012 - 3:10:22 PM
Last modification on : Friday, October 23, 2020 - 4:34:14 PM
Long-term archiving on: : Friday, December 16, 2016 - 11:11:28 PM

Identifiers

  • HAL Id : tel-00740601, version 1

Collections

Citation

Macmurray Erin. Discours de presse et veille stratégique d'événements Approche textométrique et extraction d'informations pour la fouille de textes. Linguistique. Université de la Sorbonne nouvelle - Paris III, 2012. Français. ⟨tel-00740601⟩

Share

Metrics

Record views

1182

Files downloads

24006