Skip to Main content Skip to Navigation

Algorithmes pour la reconstruction de séquences de marqueurs conservés dans des données de métagénomique

Pierre Pericard 1, 2 
2 BONSAI - Bioinformatics and Sequence Analysis
Université de Lille, Sciences et Technologies, Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189, CNRS - Centre National de la Recherche Scientifique
Abstract : Recent advances in DNA sequencing now allow studying the genetic material from microbial communities extracted from natural environmental samples. This new research field, called metagenomics, is leading innovation in many areas such as human health, agriculture, and ecology. To analyse such samples, new bioinformatics methods are still needed to ascertain the studied community taxonomic composition because accurate organisms identification is a necessary step to understand even the simplest ecosystems. However, current sequencing technologies are generating short and noisy DNA fragments, which only partially cover the complete genes sequences, giving rise to a major challenge for high resolution taxonomic analysis. We developped MATAM, a new bioinformatic methods dedicated to fast reconstruction of low-error complete sequences from conserved phylogenetic markers, starting from raw sequencing data. This methods is a multi-step process that builds and analyses a read overlap graph. We applied MATAM to the reconstruction of the small sub unit ribosomal ARN in simulated, synthetic and genuine metagenomes. We obtained high quality results, improving the state of the art.
Document type :
Complete list of metadata

Cited literature [90 references]  Display  Hide  Download
Contributor : Pierre Pericard Connect in order to contact the contributor
Submitted on : Tuesday, March 20, 2018 - 6:01:14 PM
Last modification on : Wednesday, September 7, 2022 - 8:14:05 AM
Long-term archiving on: : Tuesday, September 11, 2018 - 10:26:07 PM


Files produced by the author(s)


  • HAL Id : tel-01738687, version 2


Pierre Pericard. Algorithmes pour la reconstruction de séquences de marqueurs conservés dans des données de métagénomique. Bio-informatique [q-bio.QM]. Université de Lille, 2017. Français. ⟨NNT : 2017LIL10084⟩. ⟨tel-01738687v2⟩



Record views


Files downloads