Construction et utilisation d'une base de connaissances pharmacogénomique pour l'intégration de données et la découverte de connaissances

Adrien Coulet 1
1 ORPAILLEUR - Knowledge representation, reasonning
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This thesis studies the use of ontology and knowledge base for guiding various steps of the Knowledge Discovery in Databases (KDD) process in the domain of pharmacogenomics.
Data related to this domain are heterogeneous, complex, and disseminated through several data sources. Consequently, the preliminary step that consists in the preparation and the integration of data is crucial. For guiding this step, an original approach is proposed, based on a knowledge representation of the domain within two ontologies in description logics: SNP-Ontology and SO-Pharm. This approach has been implemented using semantic Web technologies and leads finally to populating a pharmacogenomic knowledge base. As a result, data to analyze are represented in the knowledge base, which is a benefit for guiding following steps of the knowledge discovery process. Firstly, I study this benefit for feature selection by illustrating how the knowledge base can be used for this purpose. Secondly, I describe and apply to pharmacogenomics a new method named Role Assertion Analysis (or RAA) that enables knowledge discovery directly from knowledge bases. This method uses data mining algorithms over assertions of our pharmacogenomic knowledge base and results in the discovery of new and relevant knowledge.
Document type :
Theses
Interface homme-machine [cs.HC]. Université Henri Poincaré - Nancy I, 2008. Français
Liste complète des métadonnées

Cited literature [169 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00332407
Contributor : Adrien Coulet <>
Submitted on : Monday, October 20, 2008 - 8:43:11 PM
Last modification on : Tuesday, October 25, 2016 - 5:02:33 PM
Document(s) archivé(s) le : Tuesday, October 9, 2012 - 2:05:10 PM

Identifiers

  • HAL Id : tel-00332407, version 1

Collections

Citation

Adrien Coulet. Construction et utilisation d'une base de connaissances pharmacogénomique pour l'intégration de données et la découverte de connaissances. Interface homme-machine [cs.HC]. Université Henri Poincaré - Nancy I, 2008. Français. 〈tel-00332407〉

Share

Metrics

Record views

967

Files downloads

3174