Estimation de l’histoire démographique des populations à partir de génomes entièrement séquencés

Abstract : The rapid development of DNA sequencing technologies is expanding the horizons of population genetic studies. It is expected that genomic data will increase our ability to reconstruct the history of populations.While this increase in genetic information will likely help biologists and anthropologists to reconstruct the demographic history of populations, it also poses big challenges. In some cases, simplicity of the model maylead to erroneous conclusions about the population under study. Recent works have shown that DNA patterns expected in individuals coming from structured populations correspond with those of unstructured populations with changes in size through time. As a consequence it is often difficult to determine whether demographic events such as expansions or contractions (bottlenecks) inferred from genetic data are real or due to the fact that populations are structured in nature. Moreover, almost no inferential method allowing to reconstruct pastdemographic size changes takes into account structure effects. In this thesis, some recent results in population genetics are presented: (i) a model choice procedure is proposed to distinguish one simple scenario of population size change from one of structured population, based on the coalescence times of two genes, showing that for these simple cases, it is possible to distinguish both models using genetic information form one single individual; (ii) by using the notion of instantaneous coalescent rate, it is demonstrated that for any scenario of structured population or any other one, regardless how complex it could be, there always exists a panmitic scenario with a precise function of population size changes havingexactly the same distribution for the coalescence times of two genes. This not only explains why spurious signals of bottlenecks can be found in structured populations but also predicts the demographic history that actual inference methods are likely to reconstruct when applied to non panmitic populations. Finally, (iii) a method based on a Markov process is developed for inferring past demographic events taking the structure into account. This is method uses the distribution of coalescence times of two genes to detect past demographic changes instructured populations from the DNA of one single individual. Some applications of the model to genomic data are discussed.
Document type :
Complete list of metadatas

Cited literature [204 references]  Display  Hide  Download
Contributor : Abes Star <>
Submitted on : Friday, March 29, 2019 - 10:23:14 AM
Last modification on : Monday, April 29, 2019 - 3:20:54 PM
Long-term archiving on : Sunday, June 30, 2019 - 1:18:15 PM


Version validated by the jury (STAR)


  • HAL Id : tel-02083602, version 1


Willy Rodriguez Valcarce. Estimation de l’histoire démographique des populations à partir de génomes entièrement séquencés. Probability [math.PR]. INSA de Toulouse, 2016. English. ⟨NNT : 2016ISAT0048⟩. ⟨tel-02083602⟩



Record views


Files downloads