46 3.5.2 SQL implementation for computing contingency table ,
,
,
,
,
,
, A.3.1 Existing libraries
, Its implementation follows a generic programming principle and its source can be found as part of the Boost distribution 4. BGL is characterized by its easy of use and integration in any program: no need to be built to be used, wealth of documentation and multiple code examples. It consists of a set of core algorithm patterns, namely, Breadth First Search, Depth First Search and Uniform Cost Search, and a set set of graph algorithms, The Boost Graph Library (BGL): is a C++ open source library that provides a generic open interface for traversing graphs
, It also allows to perform more other requests such as creating schemas, tables, constraints, etc. dtl is well documented and a variety of examples are given and commented. Moreover, instructions for using the library are provided and precision on how to use it with each DBMS is given 5. Googletest: Released under the BSD 3-clause license 6 , Google Test presents a library for writing C++ unit tests. It works on a variety of platforms and can be easily integrated to any c++ program. The library allows several test types and several options for running the tests 7, Database Template Library (dtl): dtl is a C++ open source library. The specificity of this library is that it can run on multiple platforms and C++ compilers
, A.3.2 Additional libraries
This Microsoft open source project is evolving in CodePlex 8 and takes advantage of the new set of capabilities introduced in C++. Microsoft developed the C++ REST SDK on top of the Parallel Patterns Library (PPL), and leverages PPL's task-based programming model. It enables you to stay in C++ when consuming REST services or developing other code closely related to the cloud. Such as, making calls to a synchronous API to make an HTTP GET call ,
, a text file) is used. For relational probabilistic models, relational data representation is needed. For the second case, the PostgreSQL Relational database management system has been used. Accordingly, to our contribution; learning RBN from graph database so we added Neo4j as graph database management system to deal with. PostgreSQL: is an open-source object-relational database management system. Initially created at the University of California at Berkeley, PostgreSQL is now considered among the most advanced open-source database. It supports a large part of the SQL standard and provides the possibility to be used, modified, and distributed by anyone free of charge for any purpose, structure learning) involve datasets as input. For standard probabilistic models, flat data representation
Searching for Bayesian network structures in the space of restricted acyclic partially directed graphs, J. Artif. Intell. Res. (JAIR), vol.18, issue.23, pp.445-490, 2003. ,
Lifted belief propagation: Pairwise marginals and beyond, Proceedings of the 5th European Workshop on Probabilistic Graphical Models (PGM-10), p.46, 2010. ,
Survey of graph database models, ACM Comput. Surv, vol.40, issue.1, 2008. ,
Integer linear programming for the bayesian network structure learning problem, Artif. Intell, vol.244, pp.258-271, 2017. ,
Probabilistic relational models: learning and evaluation, vol.40, p.117, 2015. ,
URL : https://hal.archives-ouvertes.fr/tel-01179501
Probabilistic relational model benchmark generation, vol.20, p.97, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01273307
Statistical analysis of non-lattice data, Journal of the Royal Statistical Society. Series D (The Statistician), vol.24, issue.3, pp.179-195, 1975. ,
Discriminative structure learning of Markov Logic Networks, ILP, ILP '08, vol.46, pp.59-76, 2008. ,
Structure learning of Markov Logic Networks through iterated local search, 18th European Conference on Artificial Intelligence ECAI2008, vol.44, pp.361-365, 2008. ,
Pattern Recognition and Machine Learning, vol.4, 2006. ,
Graphical models-methods for data analysis and mining, p.22, 2002. ,
Towards robust distributed systems (abstract), Proceedings of the Nineteenth Annual ACM Symposium on Principles of Distributed Computing, PODC '00, vol.54, p.55, 2000. ,
Theory refinement in Bayesian networks, Proceedings of the 7th Conference on Uncertainty in Artificial Intelligence, p.22, 1991. ,
Scalable SQL and NoSQL data stores, SIGMOD Rec, vol.39, pp.12-27, 2011. ,
Optimal structure identification with greedy search, J. Mach. Learn. Res, vol.3, p.22, 2003. ,
Learning Bayesian networks: Search methods and experimental results, Preliminary papers of the 5th International Workshop on Artificial Intelligence and Statistics, vol.22, pp.112-128, 1995. ,
Query optimization for dynamic graphs, vol.3745, pp.1-13, 2014. ,
Approximating discrete probability distributions with dependence trees, IEEE Trans. on Information Theory, vol.14, issue.3, p.26, 1968. ,
A new horizon for the recommendation: Integration of spatial dimensions to aid decision making, vol.110, p.117, 2016. ,
URL : https://hal.archives-ouvertes.fr/tel-01422348
Using probabilistic relational models to generate synthetic spatial or non-spatial databases, Proceedings of IEEE 12th International Conference on Research Challenges in Information Science (IEEE RCIS'2018), vol.78, p.80, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01761901
A relational model of data for large shared data banks, Commun. ACM, vol.13, issue.6, pp.377-387, 1970. ,
Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms, Proceedings of the ACL-02 conference on Empirical methods in natural language processing, vol.10, p.26, 2002. ,
The computational complexity of probabilistic inference using Bayesian belief networks, Artificial Intelligence, vol.42, issue.2-3, pp.393-405, 1992. ,
A Bayesian method for the induction of probabilistic networks from data, Machine Learning, vol.9, pp.309-347, 1992. ,
Probabilistic Relational Models and Reference Uncertainty, p.117, 2015. ,
URL : https://hal.archives-ouvertes.fr/tel-01254524
A graphical query language supporting recursion, SIGMOD Conference, vol.56, p.57, 1987. ,
BASE: An ACID alternative. Queue, vol.6, p.54, 2008. ,
A differential approach to inference in Bayesian networks, 2013. ,
Graph-based approximate counting for relational probabilistic models, Working Notes of the 5th International Workshop on Statistical Relational AI (StarAI@UAI), vol.47, p.62, 2015. ,
Learning polytrees, UAI, p.26, 1999. ,
A hybrid anytime algorithm for the construction of causal models from sparse data, UAI, p.23, 1999. ,
An Introduction to Database Systems, p.32, 2003. ,
The Relational Database Dictionary, p.32, 2008. ,
Lifted First Order Probabilistic Inference, IJCAI, vol.48, pp.1319-1325, 2005. ,
MPE and partial inversion in lifted probabilistic variable elimination, Proceedings of the 21sth National Conference on Artificial Intelligence (AAAI), pp.1-8, 2006. ,
Anytime lifted belief propagation, vol.46, pp.10-13, 2009. ,
Model-Driven Design of Graph Databases, vol.63, p.70, 2014. ,
Graph Theory, p.12, 2005. ,
Statistical relational learning : Structure learning for Markov logic networks. (Apprentissage statistique relationnel : apprentissage de structures de réseaux de Markov logiques), vol.32, p.94, 2011. ,
URL : https://hal.archives-ouvertes.fr/tel-00659738
Markov Logic: An Interface Layer for AI, p.46, 2009. ,
Markov logic: A unifying framework for statistical relational learning, Proceedings of the ICML-2004 Workshop on Statistical Relational Learning and its Connections to Other Fields, vol.92, pp.49-54, 2004. ,
A tractable first-order probabilistic logic, Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, AAAI'12, vol.32, p.35, 2012. ,
Who killed the directed model?, Computer Vision and Pattern Recognition, p.28, 2008. ,
Probability: Theory and Examples, 2009. ,
NoSQL: Einstieg in die Welt nichtrelationaler Web 2.0 Datenbanken, 2011. ,
Daper learning from (partially structured) graph database, 14th ACS/IEEE International Conferenceon Computer Systems and Applications AICCSA, p.68, 2017. ,
Daper joint learning from partially structured graph databases, 3rd International Conference on Digital Economy ICDEc, p.92, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01804057
, How graph databases solve problems in network & data center management: a close look at two deployments, EMA, p.61, 2013.
Incremental graph pattern matching, ACM Trans. Database Syst, vol.38, issue.3, 2013. ,
Learning probabilistic relational models, IJCAI, vol.5, p.73, 1999. ,
Learning Bayesian network structure from massive datasets: The "sparse candidate" algorithm, UAI, p.23, 1999. ,
Learning Statistical Models from Relational Data, vol.47, p.93, 2001. ,
Probabilistic relational models, chapter Introduction to Statistical Relational Learning, p.39, 2007. ,
Graph database systems, Engineering in Medicine and Biology Magazine, vol.14, pp.737-745, 2002. ,
Survey on NoSQL database, Pervasive Computing and Applications (ICPCA), vol.54, pp.363-366, 2011. ,
Learning Bayesian networks: The combination of knowledge and statistical data, Machine Learning, vol.21, pp.197-243, 1995. ,
Probabilistic entity-relationship models, prms, and plate models. ICML, vol.5, p.68, 2004. ,
Nosql database: A scalable, availability, high performance storage for big data, Pervasive Computing and the Networked World, vol.8351, p.53, 2014. ,
Neo4j: The world's leading graph database, vol.63, 2012. ,
The Defnitive Guide to Graph Databases for the RDBMS Developer, vol.127, p.129, 2016. ,
Discriminative structure and parameter learning for Markov Logic Networks, Proceedings of the 25th International Conference on Machine Learning (ICML), pp.1-8, 2008. ,
Bayesian Networks and Decision Graphs, p.28, 2007. ,
An empirical comparison of graph databases, SocialCom, p.63, 2013. ,
A study of approximate inference in probabilistic relational models, JMLR.org, vol.13, pp.315-330, 2010. ,
Counting Belief Propagation, Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, UAI '09, p.46, 2009. ,
Relaxation of acid properties in autra, the adaptive user-defined transaction relaxing approach, Future Generation Comp. Syst, vol.27, issue.1, pp.58-66, 2011. ,
Structure learning for Markov Logic Networks with many descriptive attributes, Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, p.44, 2010. ,
A computational model for combined causal and diagnostic reasoning in inference systems, Proceedings of the Eighth International Joint Conferences on Artificial Intelligence (IJCAI-1983), p.27, 1983. ,
Markov random fields and their applications. American mathematical society, p.26, 1980. ,
Learning structure of Markov Logic Networks, Proceedings of the 22Nd International Conference on Machine Learning, ICML '05, vol.40, p.98, 2005. ,
DOI : 10.1145/1102351.1102407
Learning Markov Logic Networks structure via hypergraph lifting, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, p.44, 2009. ,
DOI : 10.1145/1553374.1553440
Learning Markov Logic Networks using structural motifs, Proceedings of the 27th International Conference on Machine Learning (ICML-10), vol.44, pp.551-558, 2010. ,
Probabilistic Graphical Models: Principles and TechniquesAdaptive Computation and Machine Learning, vol.4, p.28, 2009. ,
Probabilistic frame-based systems, AAAI/IAAI, vol.5, p.32, 1998. ,
Semantics and inference for recursive probability models, Proc. 17th National Conference on Artificial Intelligence (AAAI-00), vol.48, pp.538-544, 2000. ,
Local Computations with Probabilities on Graphical Structures and Their Application to Expert Systems, Journal of the Royal Statistical Society, Series B, vol.50, issue.2, pp.157-224, 1988. ,
Finding multi-criteria optimal paths in multi-modal public transportation networks using the transit algorithm, p.61, 2012. ,
ACID transaction, Encyclopedia of Database Systems, p.35, 2009. ,
An empirical comparison of the summarization power of graph clustering methods, 2015. ,
Improving Markov network structure learning using decision trees, Journal of Machine Learning Research, vol.15, issue.1, pp.501-532, 2014. ,
DOI : 10.1109/icdm.2010.128
URL : http://ix.cs.uoregon.edu/%7Elowd/icdm10lowd.pdf
Efficient weight learning for Markov Logic Networks, Proceedings of the 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2007, vol.44, pp.200-211, 2007. ,
DOI : 10.1007/978-3-540-74976-9_21
URL : https://link.springer.com/content/pdf/10.1007%2F978-3-540-74976-9_21.pdf
A sound and complete algorithm for learning causal models from relational data, vol.80, p.97, 2013. ,
Reasoning about independence in probabilistic models of relational data, 2013. ,
Learning causal models of relational domains, Proceedings of the Twenty-fourth National Conference on Artificial Intelligence, p.39, 2010. ,
Learning Bayesian Network Model Structure From Data, vol.19, p.20, 2003. ,
Efficiently inducing features of conditional random fields, Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI03), p.27, 2003. ,
Bottom-up learning of Markov logic network structure, ICML, vol.27, p.44, 2007. ,
Lifted Probabilistic Inference with Counting Formulas, Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, AAAI, vol.46, pp.1062-1068, 2008. ,
Cached Sufficient Statistics for Efficient Machine Learning with Large Datasets, Journal of Artificial Intelligence Research (JAIR), vol.8, pp.67-91, 1998. ,
Inductive logic programming: Theory and methods, The Journal of Logic Programming, vol.19, issue.0, pp.629-679, 1994. ,
, , p.18, 2011.
Relational dependency networks, Introduction to Statistical Relational Learning, vol.5, p.36, 2005. ,
Factorbase: Sql for learning a multi-relational graphical model, vol.48, p.62, 2015. ,
Learning the network structure of heterogeneous data via pairwise exponential markov random fields, Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, vol.27, pp.1302-1310, 2017. ,
Neo4j in Action, vol.6, p.63, 2014. ,
Reverend Bayes on inference engines: A distributed hierarchical approach, Proceedings AAAI Natinal Conference on AI, p.27, 1982. ,
Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, vol.4, p.23, 1988. ,
Semantics and inference for recursive probability models, AAAI/IAAI, p.38, 2000. ,
Inducing features of random fields, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, issue.4, pp.380-393, 1997. ,
First-Order Probabilistic Inference, IJCAI, vol.46, pp.985-991, 2003. ,
Sound and efficient inference with probabilistic and deterministic dependencies, Proceedings of the 21st National Conference on Artificial Intelligence, AAAI'06, p.48, 2006. ,
Probabilistic inductive logic programming, Probabilistic Inductive Logic Programming, vol.32, p.35, 2008. ,
Problog: A probabilistic prolog and its application in link discovery, vol.110, pp.2462-2467, 2007. ,
Probability theory, 2007. ,
Graph patterns : structure, query answering and applications in schema mappings and formal language theory, EThOS, p.58, 2013. ,
Markov Logic Networks, Machine Learning, vol.62, p.44, 2006. ,
From network mining to large scale business networks, Proceedings of the 21st International Conference on World Wide Web, WWW '12 Companion, p.61, 2012. ,
Unraveling protein networks with power graph analysis, PLoS Computational Biology, vol.4, issue.7, p.61, 2008. ,
On directly mapping relational databases to rdf and owl, Proceedings of the 21st International Conference on World Wide Web, WWW '12, vol.63, p.70, 2012. ,
Performance evaluations of graph database using cuda and openmp compatible libraries, SIGARCH Computer Architecture News, vol.42, issue.4, pp.75-80, 2014. ,
An algorithm for the construction of Bayesian network structures from data, UAI, p.23, 1993. ,
Construction of bayesian network structures from data: A brief survey and an efficient algorithm, International Journal of Approximate Reasoning, vol.12, issue.2, pp.111-131, 1995. ,
Lifted first-order belief propagation, AAAI, vol.46, p.48, 2008. ,
First-Order Logic, p.32, 1968. ,
Causality from probability, Evolving Knowledge in the Natural and Behavioral Sciences, 1990. ,
Causation, Prediction, and Search, vol.20, p.39, 2000. ,
Maximum likelihood bounded tree-width Markov networks, Artif. Intell, vol.143, issue.1, pp.123-138, 2003. ,
Big data analytics for healthcare, vol.5, p.52, 2013. ,
Relational markov networks, Introduction to Statistical Relational Learning, vol.5, p.38, 2007. ,
The max-min hill-climbing bayesian network structure learning algorithm, Mach. Learn, vol.65, issue.1, p.40, 2006. ,
Lifted probabilistic inference by first-order knowledge compilation, Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI), vol.46, pp.2178-2185, 2011. ,
Equivalence and synthesis of causal models, UAI, vol.18, p.20, 1990. ,
A comparison of a graph database and a relational database: A data provenance perspective, The 48th Annual Southeast Regional Conference, vol.47, p.63, 2010. ,
High-dimensional graphical model selection using l1-regularized logistic regression, NIPS, p.27, 2006. ,
The top 5 use cases of graph databases. Technical report, Neo4j, the 1 Platform for Connected Data, p.61, 2012. ,
Towards efficient sampling: Exploiting random walk strategies, AAAI, p.48, 2004. ,
Query languages for graph databases, SIGMOD Rec, vol.41, issue.1, 2012. ,
Modeling and reasoning with Bayesian networks, Artif. Intell, vol.174, issue.2, pp.147-151, 2010. ,
Towards effective partition management for large graphs, SIGMOD Conference, p.63, 2012. ,
Incremental Bayesian network structure learning from data streams, p.110, 2013. ,
URL : https://hal.archives-ouvertes.fr/tel-01284332
Generalized belief propagation, Advances in Neural Information Processing Systems (NIPS), vol.13, p.28, 2000. ,
Exploiting causal independence in Bayesian network inference, JAIR, vol.5, p.48, 1996. ,