Skip to Main content Skip to Navigation
Theses

Predicting query performance and explaining results to assist Linked Data consumption

Rakebul Hasan 1
1 WIMMICS - Web-Instrumented Man-Machine Interactions, Communities and Semantics
CRISAM - Inria Sophia Antipolis - Méditerranée , Laboratoire I3S - SPARKS - Scalable and Pervasive softwARe and Knowledge Systems
Abstract : Our goal is to assist users in understanding SPARQL query performance, query results, and derivations on Linked Data. To help users in understanding query performance, we provide query performance predictions based on the query execution history. We present a machine learning approach to predict query performances. We do not use statistics about the underlying data for our predictions. This makes our approach suitable for the Linked Data scenario where statistics about the underlying data is often missing such as when the data is controlled by external parties. To help users in understanding query results, we provide provenance-based query result explanations. We present a non-annotation-based approach to generate why-provenance for SPARQL query results. Our approach does not require any re-engineering of the query processor, the data model, or the query language. We use the existing SPARQL 1.1 constructs to generate provenance by querying the data. This makes our approach suitable for Linked Data. We also present a user study to examine the impact of query result explanations. Finally to help users in understanding derivations on Linked Data, we introduce the concept of Linked Explanations. We publish explanation metadata as Linked Data. This allows explaining derived data in Linked Data by following the links of the data used in the derivation and the links of their explanation metadata. We present an extension of the W3C PROV ontology to describe explanation metadata. We also present an approach to summarize these explanations to help users filter information in the explanation, and have an understanding of what important information was used in the derivation.
Complete list of metadatas

Cited literature [68 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01127124
Contributor : Abes Star :  Contact
Submitted on : Saturday, March 7, 2015 - 12:14:46 AM
Last modification on : Thursday, March 5, 2020 - 4:52:01 PM
Document(s) archivé(s) le : Monday, June 8, 2015 - 10:31:17 AM

File

2014NICE4082.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01127124, version 1

Collections

Citation

Rakebul Hasan. Predicting query performance and explaining results to assist Linked Data consumption. Other [cs.OH]. Université Nice Sophia Antipolis, 2014. English. ⟨NNT : 2014NICE4082⟩. ⟨tel-01127124⟩

Share

Metrics

Record views

535

Files downloads

791