Skip to Main content Skip to Navigation
Theses

Nouvelles approches pour l'exploitation des données de séquences génomique haut débit

Antoine Limasset 1, 2
2 GenScale - Scalable, Optimized and Parallel Algorithms for Genomics
Inria Rennes – Bretagne Atlantique , IRISA-D7 - GESTION DES DONNÉES ET DE LA CONNAISSANCE
Abstract : Novel approaches for the exploitation of high throughput sequencing data In this thesis we discuss computational methods to deal with DNA sequences provided by high throughput sequencers. We will mostly focus on the reconstruction of genomes from DNA fragments (genome assembly) and closely related problems. These tasks combine huge amounts of data with combinatorial problems. Various graph structures are used to handle this problem, presenting trade-off between scalability and assembly quality. This thesis introduces several contributions in order to cope with these tasks. First, novel representations of assembly graphs are proposed to allow a better scaling. We also present novel uses of those graphs apart from assembly and we propose tools to use such graphs as references when a fully assembled genome is not available. Finally we show how to use those methods to produce less fragmented assembly while remaining tractable.
Complete list of metadatas

Cited literature [159 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01686367
Contributor : Abes Star :  Contact
Submitted on : Wednesday, January 17, 2018 - 12:09:08 PM
Last modification on : Thursday, February 27, 2020 - 1:19:31 AM
Document(s) archivé(s) le : Tuesday, May 8, 2018 - 12:39:07 AM

File

LIMASSET_Antoine.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01686367, version 1

Citation

Antoine Limasset. Nouvelles approches pour l'exploitation des données de séquences génomique haut débit. Bio-informatique [q-bio.QM]. Université Rennes 1, 2017. Français. ⟨NNT : 2017REN1S049⟩. ⟨tel-01686367⟩

Share

Metrics

Record views

434

Files downloads

189