Skip to Main content Skip to Navigation
Theses

Analysis and integration of heterogeneous large-scale genomics data : application to B cell differentiation and follicular lymphoma non coding mutations

Marine Louarn 1, 2
2 Dyliss - Dynamics, Logics and Inference for biological Systems and Sequences
Inria Rennes – Bretagne Atlantique , IRISA-D7 - GESTION DES DONNÉES ET DE LA CONNAISSANCE
Abstract : Regulatory networks inference from heterogeneous data is a computational step aiming at identifying key regulators involved in differentiation processes leading to cancer. In this thesis I focus on B cell differentiation, from which follicular lymphoma emerges. The first contribution outlines the reproducibility and reusability limitations of a state-of-the-art method for network inference from genomic data. To overcome these limitations, I demonstrated that Semantic Web technologies can structure and integrate large-scale heterogeneous datasets in a systematic way (second contribution). The original analysis workflow outputs could be reproduced as queries on a graph of data, which could itself be layered and enriched with public databases (third contribution). This demonstrates the technical relevance of this approach and underlines its benefits in improving reusability and reproducibility. As a fourth contribution, a new method for network inference was designed to take expert knowledge into account - both to extend the previous framework to the analysis of smaller, closely-related datasets and to enrich the inferred networks with signs, therefore including inhibitory regulatory processes. Finally, the method was applied to B cell differentiation, leading to the discovery of 146 TF with potential large impact on the network (fifth contribution).
Document type :
Theses
Complete list of metadata

https://tel.archives-ouvertes.fr/tel-03244465
Contributor : Abes Star :  Contact
Submitted on : Tuesday, June 1, 2021 - 11:32:13 AM
Last modification on : Wednesday, November 3, 2021 - 8:09:37 AM
Long-term archiving on: : Thursday, September 2, 2021 - 6:37:07 PM

File

LOUARN_Marine.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-03244465, version 1

Citation

Marine Louarn. Analysis and integration of heterogeneous large-scale genomics data : application to B cell differentiation and follicular lymphoma non coding mutations. Bioinformatics [q-bio.QM]. Université Rennes 1, 2020. English. ⟨NNT : 2020REN1S088⟩. ⟨tel-03244465⟩

Share

Metrics

Record views

87

Files downloads

119