Skip to Main content Skip to Navigation
Theses

Utilisation des modèles de co-clustering pour l'analyse exploratoire des données

Abstract : Co-clustering is a clustering technique aiming at simultaneously partitioning the rows and the columns of a data matrix. Among the existing approaches, MODL is suitable for processing huge data sets with several continuous or categorical variables. We use it as the baseline approach in this thesis. We discuss the reliability of applying such an approach on data mining problems like graphs partitioning, temporal graphs segmentation or curve clustering. MODL tracks very fine patterns in huge data sets, that makes the results difficult to study. That is why, exploratory analysis tools must be defined in order to explore them. In order to help the user in interpreting the results, we define exploratory analysis tools aiming at simplifying the results in order to make possible an overall interpretation, tracking the most interesting patterns, determining the most representative values of the clusters and visualizing the results. We investigate the asymptotic behavior of these exploratory analysis tools in order to make the connection with the existing approaches. Finally, we highlight the value of MODL and the exploratory analysis tools owing to an application on call detailed records from the telecom operator Orange, collected in Ivory Coast.
Complete list of metadatas

Cited literature [111 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00935278
Contributor : Romain Guigourès <>
Submitted on : Thursday, January 23, 2014 - 12:20:23 PM
Last modification on : Sunday, January 19, 2020 - 6:38:32 PM
Document(s) archivé(s) le : Thursday, April 24, 2014 - 2:20:15 AM

Identifiers

  • HAL Id : tel-00935278, version 1

Citation

Romain Guigourès. Utilisation des modèles de co-clustering pour l'analyse exploratoire des données. Applications [stat.AP]. Université Panthéon-Sorbonne - Paris I, 2013. Français. ⟨tel-00935278⟩

Share

Metrics

Record views

1149

Files downloads

9246