Probabilistic relational models: learning and evaluation

Abstract : Statistical relational learning (SRL) appeared in the early 2000s as a new field of machine learning that enables effective and robust reasoning about relational data structures. Several conventional data mining methods have been adapted for direct application to relational data representation. Relational Bayesian Networks (RBNs) extend Bayesian networks (BNs) to a relational data mining context. To use this model, it is first necessary to build it: the structure and parameters of a RBN must be set manually or learned from a relational observational dataset. Learning the structure remains the most complicated issue as it is a NP-hard problem. Existing approaches for RBNs structure learning are inspired from classical methods of learning the structure of BNs. The evaluation of learning approaches requires testing datasets and evaluation measurements. For BNs, datasets are usually sampled from real known networks. Otherwise, processes to randomly generate the model and the data are already established. Both practices are almost absent for RBR. Moreover, metrics to evaluate a RBN structure learning algorithm are not yet proposed. This thesis provides two major contributions. I) A synthetic approach allowing to generate random RBNs from scratch. The proposed method allows to generate RBNs as well as synthetic relational data from a randomly generated relational schema and a random set of probabilistic dependencies. Also, we discuss the adaptation of the evaluation metrics of BNs structure learning algorithms to the relational context and we propose new relational evaluation measurements. II) A hybrid approach for RBNs structure learning. This approach presents an extension of the MMHC algorithm in the relational context. We present an experimental study to compare this new learning algorithm with the state-of-the-art approaches.
Document type :
Theses
Complete list of metadatas

Cited literature [144 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01179501
Contributor : Mouna Ben Ishak <>
Submitted on : Wednesday, July 22, 2015 - 4:12:49 PM
Last modification on : Thursday, April 5, 2018 - 10:36:49 AM
Long-term archiving on : Friday, October 23, 2015 - 11:20:55 AM

Identifiers

  • HAL Id : tel-01179501, version 1

Collections

Citation

Mouna Ben Ishak. Probabilistic relational models: learning and evaluation. Computer Science [cs]. Université de Nantes, Ecole Polytechnique; Université de Tunis, Institut Supérieur de Gestion de Tunis, 2015. English. ⟨tel-01179501⟩

Share

Metrics

Record views

573

Files downloads

1696