Reconstruction and Clustering with Graph optimization and Priors on Gene networks and Images

Abstract : The discovery of novel gene regulatory processes improves the understanding of cell phenotypic responses to external stimuli for many biological applications, such as medicine, environment or biotechnologies. To this purpose, transcriptomic data are generated and analyzed from DNA microarrays or more recently RNAseq experiments. They consist in genetic expression level sequences obtained for all genes of a studied organism placed in different living conditions. From these data, gene regulation mechanisms can be recovered by revealing topological links encoded in graphs. In regulatory graphs, nodes correspond to genes. A link between two nodes is identified if a regulation relationship exists between the two corresponding genes. Such networks are called Gene Regulatory Networks (GRNs). Their construction as well as their analysis remain challenging despite the large number of available inference methods. In this thesis, we propose to address this network inference problem with recently developed techniques pertaining to graph optimization. Given all the pairwise gene regulation information available, we propose to determine the presence of edges in the final GRN by adopting an energy optimization formulation integrating additional constraints. Either biological (information about gene interactions) or structural (information about node connectivity) a priori have been considered to restrict the space of possible solutions. Different priors lead to different properties of the global cost function, for which various optimization strategies, either discrete and continuous, can be applied. The post-processing network refinements we designed led to computational approaches named BRANE for "Biologically-Related A priori for Network Enhancement". For each of the proposed methods — BRANE Cut, BRANE Relax and BRANE Clust — our contributions are threefold: a priori-based formulation, design of the optimization strategy and validation (numerical and/or biological) on benchmark datasets from DREAM4 and DREAM5 challenges showing numerical improvement reaching 20%. In a ramification of this thesis, we slide from graph inference to more generic data processing such as inverse problems. We notably invest in HOGMep, a Bayesian-based approach using a Variational Bayesian Approximation framework for its resolution. This approach allows to jointly perform reconstruction and clustering/segmentation tasks on multi-component data (for instance signals or images). Its performance in a color image deconvolution context demonstrates both quality of reconstruction and segmentation. A preliminary study in a medical data classification context linking genotype and phenotype yields promising results for forthcoming bioinformatics adaptations.
Complete list of metadatas

Cited literature [403 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01879732
Contributor : Aurélie Pirayre <>
Submitted on : Thursday, October 18, 2018 - 1:26:36 PM
Last modification on : Tuesday, January 22, 2019 - 3:51:44 PM
Long-term archiving on : Saturday, January 19, 2019 - 12:28:54 PM

Identifiers

  • HAL Id : tel-01879732, version 1

Collections

Citation

Aurélie Pirayre. Reconstruction and Clustering with Graph optimization and Priors on Gene networks and Images. Methods and statistics. Université Paris Est - Marne-la-Vallée, 2017. English. ⟨tel-01879732⟩

Share

Metrics

Record views

272

Files downloads

120