Extraction de Séquences Fréquentes : Des Bases de Données Statiques aux Flots de Données

Chedy Raïssi 1
1 TATOO - Fouille de données environnementales
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Recently, with the constant progress in software and hardware technologies, real-world databases tend to grow larger and the hypothesis that a database can be efficiently analyzed by a human being is no longer valid. In order to cope with this new challenge, a new set of methods and technologies (i.e., Knowledge Discovery and Delivery Data Mining or KDD) were developped to enable data exploration, data analysis and data visualisation of very large databases at a high level of abstraction. Furthermore, the data stream model, a new model of data as a continuous and potentially infinite flow, call for new approaches for the pre-processing and data mining operations.
In this thesis, we focus on a very specific data mining method: sequential pattern mining. This method aims at discovering correlations among events through time. We present very important theoretical results on the possible concise representations for sequences. Furthermore, we introduce a new pattern called conjunctive sequential pattern. Sampling methods and approaches for sequential pattern mining over data streams and static databases are also introduced and discussed. Moreover, we define an algorithm that allows the discovery of multidimensional sequential patterns. These approaches are also validated with experiments on real data.
Document type :
Theses
Complete list of metadatas

https://tel.archives-ouvertes.fr/tel-00351626
Contributor : Martine Peridier <>
Submitted on : Friday, January 9, 2009 - 4:32:59 PM
Last modification on : Thursday, May 24, 2018 - 3:59:22 PM
Long-term archiving on : Tuesday, June 8, 2010 - 7:15:20 PM

Identifiers

  • HAL Id : tel-00351626, version 1

Collections

Citation

Chedy Raïssi. Extraction de Séquences Fréquentes : Des Bases de Données Statiques aux Flots de Données. Informatique [cs]. Université Montpellier II - Sciences et Techniques du Languedoc, 2008. Français. ⟨tel-00351626⟩

Share

Metrics

Record views

435

Files downloads

3365