Actual Web mining and Web data management applications: ? Web graph mining ? wrapper induction ? Web crawling Research on the foundations of Web data management Plan: bridge the two Use formal models for data exchange [Senellart and Gottlob Gottlob and Senellart, 2010] to perform wrapper induction or ontology matching Use static analysis techniques for JavaScript to perform deep Web data extraction Use results of formal studies of the containment of recursive query languages, Pierre Senellart DBWeb Formal Models for Web Content Acquisition Current mismatch between to optimize query answering over the deep Web Questions, p.2012, 2008. ,
A probabilistic XML merging tool Demonstration, Proc. EDBT, pp.538-541, 2011. ,
DOI : 10.1145/1951365.1951435
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.186.8080
Querying and Updating Probabilistic Information in XML, Proc. EDBT, pp.1059-1068, 2006. ,
DOI : 10.1007/11687238_62
URL : https://hal.archives-ouvertes.fr/inria-00106788
On the expressiveness of probabilistic XML models, The VLDB Journal, vol.31, issue.4, pp.1041-1064, 2009. ,
DOI : 10.1007/s00778-009-0146-1
URL : https://hal.archives-ouvertes.fr/inria-00429498
Aggregate queries for discrete and continuous probabilistic XML, Proceedings of the 13th International Conference on Database Theory, ICDT '10, pp.50-61, 2010. ,
DOI : 10.1145/1804669.1804679
URL : https://hal.archives-ouvertes.fr/inria-00537632
Capturing continuous data and answering aggregate queries in probabilistic XML, ACM Transactions on Database Systems, vol.36, issue.4, 2011. ,
DOI : 10.1145/2043652.2043658
URL : https://hal.archives-ouvertes.fr/hal-00677722
Finding optimal probabilistic generators for XML collections, Proceedings of the 15th International Conference on Database Theory, ICDT '12, 2012. ,
DOI : 10.1145/2274576.2274591
URL : https://hal.archives-ouvertes.fr/hal-00765545
Auto-completion learning for XML Demonstration, Proc. SIGMOD, pp.669-672, 2012. ,
Towards a version control model with uncertain data, Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management, PIKM '11, 2011. ,
DOI : 10.1145/2065003.2065013
URL : https://hal.archives-ouvertes.fr/hal-00745195
Probabilistic XML via Markov Chains, Proceedings of the VLDB Endowment, pp.770-781, 2010. ,
DOI : 10.14778/1920841.1920939
URL : https://hal.archives-ouvertes.fr/hal-00537778
Determining relevance of accesses at runtime, Proceedings of the 30th symposium on Principles of database systems of data, PODS '11, pp.211-222, 2011. ,
DOI : 10.1145/1989284.1989309
URL : https://hal.archives-ouvertes.fr/hal-00603647
Monadic Datalog Containment, Proc. ICALP, pp.79-91, 2012. ,
DOI : 10.1007/978-3-642-31585-5_11
URL : https://hal.archives-ouvertes.fr/hal-00809306
ProFoUnd, Proceedings of the 21st international conference companion on World Wide Web, WWW '12 Companion ,
DOI : 10.1145/2187980.2188037
URL : https://hal.archives-ouvertes.fr/hal-00690624
Running tree automata on probabilistic XML, Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, PODS '09, 2009. ,
DOI : 10.1145/1559795.1559831
Probabilistic databases, Communications of the ACM, vol.52, issue.7, 2009. ,
DOI : 10.1145/1538788.1538810
SPROUT 2 : a squared query engine for uncertain web data, SIGMOD, 2011. ,
Fuzzy Databases: Modeling, Design And Implementation, 2005. ,
DOI : 10.4018/978-1-59140-324-1
Schema mapping discovery from data instances, Journal of the ACM, vol.57, issue.2, 2010. ,
DOI : 10.1145/1667053.1667055
URL : https://hal.archives-ouvertes.fr/inria-00537238
Models for incomplete and probabilistic information, Proc. EDBT Workshops, IIDB, 2006. ,
Incomplete Information in Relational Databases, Journal of the ACM, vol.31, issue.4, pp.761-791, 1984. ,
DOI : 10.1145/1634.1886
Updating probabilistic XML, Proceedings of the 1st International Workshop on Data Semantics, DataSem '10, 2010. ,
DOI : 10.1145/1754239.1754264
URL : https://hal.archives-ouvertes.fr/hal-00537793
Value joins are expensive over (probabilistic) XML, Proceedings of the 4th International Workshop on Logic in Databases, LID '11, pp.41-48, 2011. ,
DOI : 10.1145/1966357.1966366
URL : https://hal.archives-ouvertes.fr/inria-00591905
Query evaluation over probabilistic XML, The VLDB Journal, vol.453, issue.1???2, 2009. ,
DOI : 10.1007/s00778-009-0150-5
MayBMS: A system for managing large uncertain and probabilistic databases, Managing and Mining Uncertain Data, 2009. ,
Comprendre le Web caché Understanding the Hidden Web, 2007. ,
On the complexity of managing probabilistic XML data, Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems , PODS '07, pp.283-292, 2007. ,
DOI : 10.1145/1265530.1265570
URL : https://hal.archives-ouvertes.fr/inria-00137138
On the complexity of deriving schema mappings from database instances, Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems , PODS '08, pp.23-32, 2008. ,
DOI : 10.1145/1376916.1376921
ProApproX, Proceedings of the 2011 international conference on Management of data, SIGMOD '11, pp.1295-1298, 2011. ,
DOI : 10.1145/1989323.1989480
URL : https://hal.archives-ouvertes.fr/hal-00745187
Automatic wrapper induction from hidden-web sources with domain knowledge, Proceeding of the 10th ACM workshop on Web information and data management, WIDM '08, pp.9-16, 2008. ,
DOI : 10.1145/1458502.1458505
URL : https://hal.archives-ouvertes.fr/inria-00337098
Efficient query evaluation over probabilistic XML with long-distance dependencies, Proceedings of the 2011 Joint EDBT/ICDT Ph.D. Workshop on, PhD '11, 2011. ,
DOI : 10.1145/1966874.1966880
Optimizing approximations of DNF query lineage in probabilistic XML, 2013 IEEE 29th International Conference on Data Engineering (ICDE), 2012. ,
DOI : 10.1109/ICDE.2013.6544869
URL : https://hal.archives-ouvertes.fr/hal-00874442
YAGO: A core of semantic knowledge. Unifying WordNet and Wikipedia, WWW, pp.697-706, 2007. ,
Trio: A system for integrated management of data, accuracy, and lineage, CIDR, 2005. ,
A simple view of the Dempster-Shafer theory of evidence and its implication for the rule of combination ,