Skip to Main content Skip to Navigation
Theses

Provenance et Qualité dans les Workflows Orientés Données : application à la plateforme WebLab

Clément Caron 1
1 BD - Bases de Données
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : The WebLab platform is an application used to define and execute media-mining workflows. It is an open source platform, developed by the IPCC1 section of Airbus Defence and Space, for the integration of external components. A designer can create complex media-mining workflows using components, whose operation is not always known (black-boxes services). These complex workflows can lead to a problem of data quality, however, and before this work, no tool existed to analyse and improve the quality of WebLab workflows. To deal with black-box services, we choose to tackle this quality problem with a non-intrusive approach: we enhance the definition of the WebLab workflow with provenance and quality propagation rules. Provenance rules generate fine-grained data dependency links between data and services after the execution of a WebLab workflow. Then the quality propagation rules use these links to reason on the influence that the quality of the data used by a component has on the quality of the output data…
Document type :
Theses
Complete list of metadata

Cited literature [53 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01331050
Contributor : Abes Star :  Contact Connect in order to contact the contributor
Submitted on : Monday, June 13, 2016 - 1:43:07 PM
Last modification on : Friday, January 8, 2021 - 5:32:09 PM

File

2015PA066568.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01331050, version 1

Citation

Clément Caron. Provenance et Qualité dans les Workflows Orientés Données : application à la plateforme WebLab. Base de données [cs.DB]. Université Pierre et Marie Curie - Paris VI, 2015. Français. ⟨NNT : 2015PA066568⟩. ⟨tel-01331050⟩

Share

Metrics

Record views

481

Files downloads

222