Skip to Main content Skip to Navigation
Theses

Représentation et gestion des connaissances dans un processus d'Extraction de Connaissances à partir de Données multi-points de vue

El Moukhtar Zemmouri 1
1 Modélisation et Extaction de Connaissances - ModEC
LM2I - Laboratoire de Modélisation Mathématique et Informatique
Abstract : In recent decades, enterprises' information systems become more and more flooded by all kind of data: structured (databases, data warehouse), semi-structured (XML, server log files), and unstructured data (raw text, multimedia data). This has created new challenges for companies and for the scientific community. Including, how to understand and analyze such a mass of data to extract knowledge. Moreover, in an organization, a data mining project is usually conducted by several experts (domain experts, KDD experts, data experts...) who consequently manipulate several types of knowledge and know-how. They will have different objectives and preferences, different competences, and different visions of analyzed data and of KDD methods. Our objective in this thesis is to facilitate the KDD analyst task, and to improve coordination and comprehensibility between the different actors in a multi-view analysis as well as the reuse of KDD process in terms of viewpoints. Therefore, we propose a definition that makes explicit the notion of viewpoint in KDD and includes domain knowledge (analyzed domain and analyst domain) and context of analysis. Based on this definition, we propose the development of a set of semantic models that are structured in a Conceptual Model and allowing knowledge representation and management during a multi-view analysis. Our approach is based on a multi-criteria characterization of viewpoint in KDD. A characterization that is primarily designed to capture the objectives and context of analysis of the expert, guide the construction and execution of the KDD process, and then keep the trace, in the form of annotations, of reasoning made during a collaborative work. These annotations can be shared, compared and reused based on a set of semantic relations between viewpoints.
Complete list of metadata

Cited literature [152 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00940780
Contributor : El Moukhtar Zemmouri Connect in order to contact the contributor
Submitted on : Sunday, February 2, 2014 - 9:08:57 PM
Last modification on : Friday, October 23, 2020 - 4:45:56 PM
Long-term archiving on: : Sunday, April 9, 2017 - 5:43:23 AM

Identifiers

  • HAL Id : tel-00940780, version 1

Citation

El Moukhtar Zemmouri. Représentation et gestion des connaissances dans un processus d'Extraction de Connaissances à partir de Données multi-points de vue. Apprentissage [cs.LG]. Ecole Nationale Supérieure d'Arts et Métiers - Meknès, 2013. Français. ⟨tel-00940780⟩

Share

Metrics

Record views

649

Files downloads

3174