B. Alberts, Molecular Biology of the Cell. 4th. Garland, 2002.

A. ,

. Stephen-f-altschul, Basic local alignment search tool, Journal of molecular biology, vol.215, pp.403-410, 1990.

. Stephen-f-altschul, Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic acids research, vol.25, pp.3389-3402, 1997.

A. Apostolico, Parallel construction of a suffix tree with applications, Algorithmica 3, pp.347-365, 1988.

E. Audemard, T. Schiex, and T. Faraut, Detecting long tandem duplications in genomic sequences, BMC bioinformatics, vol.13, p.83, 2012.

D. Bachtrog, Y-chromosome evolution: emerging insights into processes of Ychromosome degeneration, Nature Reviews Genetics, vol.14, p.113, 2013.

A. Backurs and P. Indyk, Edit distance cannot be computed in strongly subquadratic time (unless SETH is false, Proceedings of the forty-seventh annual ACM symposium on Theory of computing, pp.51-58, 2015.

M. Bahlo, Recent advances in the detection of repeat expansions with short-read next-generation sequencing, pp.1000-1007, 2018.

A. Jeffrey, E. E. Bailey, and . Eichler, Primate segmental duplications: crucibles of evolution, diversity and disease, Nature Reviews Genetics, vol.7, issue.7, p.552, 2006.

A. Jeffrey, G. Bailey, E. E. Liu, and . Eichler, An Alu transposition model for the origin and expansion of human segmental duplications, The American Journal of Human Genetics, vol.73, pp.823-834, 2003.

A. Jeffrey and . Bailey, Recent segmental duplications in the human genome, Science, vol.297, pp.1003-1007, 2002.

A. Jeffrey and . Bailey, Segmental duplications: organization and impact within the current human genome project assembly, Genome research, vol.11, pp.1005-1017, 2001.

P. Balaresque, Dynamic nature of the proximal AZFc region of the human Y chromosome: multiple independent deletion and duplication events revealed by microsatellite analysis, Human mutation, vol.29, pp.1171-1180, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00393071

Z. Bao, R. Sean, and . Eddy, Automated de novo identification of repeat sequence families in sequenced genomes, Genome research, vol.12, pp.1269-1276, 2002.

A. Joseph, I. Bedell, W. Korf, and . Gish, MaskerAid: a performance enhancement to RepeatMasker, Bioinformatics, vol.16, pp.1040-1041, 2000.

G. Benson, Tandem repeats finder: a program to analyze DNA sequences, Nucleic acids research, vol.27, pp.573-580, 1999.

G. Blanc, H. Kenneth, and . Wolfe, Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes, The plant cell, vol.16, pp.1667-1678, 2004.

G. Blanc, Extensive duplication and reshuffling in the Arabidopsis genome, The Plant Cell, vol.12, pp.1093-1101, 2000.


C. Camacho, BLAST+: architecture and applications, BMC bioinformatics, vol.10, p.421, 2009.

G. Chen, Y. Chang, and C. Hsueh, PRAP: an ab initio software package for automated genome-wide analysis of DNA repeats for prokaryotes, Bioinformatics, vol.29, pp.2683-2689, 2013.

J. Cheung, Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence, Genome biology, vol.4, p.25, 2003.

J. Cheung, Recent segmental and gene duplications in the mouse genome, Genome biology, vol.4, p.47, 2003.

. Cuda-homepage,

E. Darai-ramqvist, Segmental duplications and evolutionary plasticity at tumor chromosome break-prone regions, Genome research, 2008.

L. Arthur, . Delcher, L. Steven, A. M. Salzberg, and . Phillippy, Using MUMmer to identify similar regions in large sequence sets, Current protocols in bioinformatics, vol.1, pp.10-13, 2003.

L. Arthur and . Delcher, Alignment of whole genomes, Nucleic acids research, vol.27, pp.2369-2376, 1999.

L. Arthur and . Delcher, Fast algorithms for large-scale genome alignment and comparison, Nucleic acids research, vol.30, pp.2478-2483, 2002.

F. Delehelle, ASGART: fast and parallel genome scale segmental duplications mapping, Bioinformatics, 2018.
URL : https://hal.archives-ouvertes.fr/hal-02413201

G. Djedatin, DuplicationDetector, a light weight tool for duplication detection using NGS data, Current Plant Biology, vol.9, pp.23-28, 2017.

S. Kanwaljit and . Dulai, The evolution of trichromatic color vision by opsin gene duplication in New World and Old World primates, Genome research 9, vol.7, pp.629-638, 1999.

L. Duret and N. Galtier, Biased gene conversion and the evolution of mammalian genomic landscapes, Annual review of genomics and human genetics, vol.10, pp.285-311, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00428399

C. Robert, E. W. Edgar, and . Myers, PILER: identification and classification of genomic repeats, Bioinformatics 21.suppl_1, pp.152-158, 2005.

E. Evan and . Eichler, Widening the spectrum of human genetic variation, Nature genetics, vol.38, p.9, 2006.

, EMBOSS Stretcher online manual

M. Farach and . Muthukrishnan, Optimal logarithmic time randomized suffix tree construction, International Colloquium on Automata, Languages, and Programming, pp.550-561, 1996.

E. Fernandez, W. Najjar, and S. Lonardi, String matching in hardware using the FM-Index, Field-Programmable Custom Computing Machines (FCCM), 2011 IEEE 19th Annual International Symposium on. IEEE, pp.218-225, 2011.

P. Ferragina and G. Manzini, Opportunistic data structures with applications, Foundations of Computer Science, 2000. Proceedings. 41st Annual Symposium on. IEEE, pp.390-398, 2000.

I. Flores and G. Madpis, Average binary search length for dense ordered lists, Communications of the ACM, vol.14, pp.602-603, 1971.

D. Fredman, Complex SNP-related sequence variation in segmental genome duplications, Nature genetics, vol.36, p.861, 2004.

L. Gardiner, Analysis of the recombination landscape of hexaploid bread wheat reveals genes controlling recombination and gene conversion frequency, bioRxiv, p.539684, 2019.

, GFF2 file format specification

, GFF3 file format specification

G. Giannuzzi, Analysis of high-identity segmental duplications in the grapevine genome, BMC genomics, vol.12, p.436, 2011.

Z. Hani and . Girgis, Red: an intelligent, rapid, accurate tool for detecting repeats de-novo on the genomic scale, BMC bioinformatics, vol.16, p.227, 2015.

M. K. Stella, . Glasauer, C. F. Stephan, and . Neuhauss, Whole-genome duplication in teleost fishes and its evolutionary consequences, Molecular genetics and genomics, vol.289, pp.1045-1060, 2014.

S. Goodwin, D. John, W. Mcpherson, and . Mccombie, Coming of age: ten years of next-generation sequencing technologies, Nature Reviews Genetics, vol.17, p.333, 2016.

J. Brian and . Haas, DAGchainer: a tool for mining segmental genome duplications and synteny, Bioinformatics, vol.20, pp.3643-3646, 2004.

T. Hachiya, Accurate identification of orthologous segments among multiple genomes, Bioinformatics, vol.25, pp.853-860, 2009.

W. Matthew, J. P. Hahn, S. Demuth, and . Han, Accelerated rate of gene gain and loss in primates, Genetics, 2007.

P. Hallast, Recombination dynamics of a human Y-chromosomal palindrome: rapid GC-biased gene conversion, multi-kilobase conversion tracts, and rare inversions, PLoS genetics 9, vol.7, p.1003666, 2013.

W. Richard and . Hamming, Error detecting and error correcting codes, Bell System technical journal, vol.29, pp.147-160, 1950.

D. S. Hirschberg, A linear space algorithm for computing maximal common subsequences, Communications of the ACM, vol.18, pp.341-343, 1975.

E. B. Hook and . Albright, Mutation rates for unbalanced Robertsonian translocations associated with Down syndrome. Evidence for a temporal change in New York State live births 1968-1977, In: American journal of human genetics, vol.33, p.443, 1981.

F. Jennifer and . Hughes, Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content, Nature, vol.463, p.536, 2010.

F. Jennifer, H. Hughes, D. Skaletsky, and . Page, Sequencing of rhesus macaque Y chromosome clarifies origins and evolution of the DAZ (Deleted in AZoospermia) genes, Bioessays, vol.34, pp.1035-1044, 2012.

F. Jennifer and . Hughes, Sex chromosome-to-autosome transposition events counter Y-chromosome gene loss in mammals, Genome biology, vol.16, p.104, 2015.

P. Ibáñez, ?-Synuclein gene rearrangements in dominantly inherited parkinsonism: frequency, phenotype, and mechanisms, Archives of neurology, vol.66, pp.102-108, 2009.

J. Alec, C. A. Jeffreys, and . May, Intense and highly localized gene conversion activity in human meiotic crossover hot spots, Nature genetics, vol.36, issue.2, p.151, 2004.

Z. Jiang, Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution, Nature genetics, vol.39, p.1361, 2007.

M. Jobling, M. Hurles, and C. Tyler-smith, Human evolutionary genetics: origins, peoples & disease. Garland Science, 2013.

E. Matthew and . Johnson, Recurrent duplication-driven transposition of DNA during hominoid evolution, Proceedings of the National Academy of Sciences 103, pp.17626-17631, 2006.

J. John and . Kasianowicz, Characterization of individual polynucleotide molecules using a membrane channel, Proceedings of the National Academy of Sciences 93, vol.24, pp.13770-13773, 1996.

M. Szymon and . Kielbasa, Adaptive seeds tame genomic sequence comparison, Genome research, p.113985, 2011.

K. Dong-kyue, Linear-time construction of suffix arrays, Annual Symposium on Combinatorial Pattern Matching, pp.186-199, 2003.

P. Ko and S. Aluru, Space efficient linear time construction of suffix arrays, Journal of Discrete Algorithms, vol.3, pp.143-156, 2005.

R. Kolpakov, G. Bana, and G. Kucherov, mreps: efficient and flexible detection of tandem repeats in DNA, Nucleic acids research, vol.31, pp.3672-3678, 2003.
URL : https://hal.archives-ouvertes.fr/inria-00099597

R. Koszul and G. Fischer, A prominent role for segmental duplications in modeling eukaryotic genomes, Comptes Rendus Biologies, vol.332, pp.254-266, 2009.

. Nv-kovaleva, Results of estimation of mutation rates for translocation trisomy 21, Tsitologiia 44, vol.11, pp.1115-1119, 2002.

A. Ravinesh and . Kumar, Recurrent 16p11. 2 microdeletions in autism, Human molecular genetics, vol.17, pp.628-638, 2007.

S. Kurtz and C. Schleiermacher, REPuter: fast computation of maximal repeats in complete genomes, Bioinformatics, vol.15, pp.426-427, 1999.

S. Kurtz, Versatile and open software for comparing large genomes, Genome biology, vol.5, p.12, 2004.

T. Bruce, D. Lahn, and . Page, Four evolutionary strata on the human X chromosome, Science, vol.286, pp.964-967, 1999.

J. Michael and . Levene, Zero-mode waveguides for single-molecule analysis at high concentrations". In: science 299, vol.5607, pp.682-686, 2003.

G. Victor and . Levitsky, RECON: a program for prediction of nucleosome formation potential, Nucleic acids research, vol.32, pp.346-349, 2004.

H. Li, Fast construction of FM-index for long sequence reads, Bioinformatics, vol.30, pp.3274-3275, 2014.

Z. Li, J. Li, and H. Huo, Optimal in-place suffix sorting, International Symposium on String Processing and Information Retrieval, pp.268-284, 2018.

C. Ling and K. Benkrid, Design and implementation of a CUDA-compatible GPU-based core for gapped BLAST algorithm, Procedia Computer Science, vol.1, issue.1, pp.495-504, 2010.

J. David, . Lipman, and . William-r-pearson, Rapid and sensitive protein similarity searches, Science, vol.227, pp.1435-1441, 1985.

H. Liu, Tetrad analysis in plants and fungi finds large differences in gene conversion rates but no GC bias, Nature ecology & evolution, vol.2, issue.1, p.164, 2018.

U. Manber and G. Myers, Suffix arrays: a new method for on-line string searches, In: siam Journal on Computing, vol.22, pp.935-948, 1993.

T. Marques-bonet, S. Girirajan, and E. E. Eichler, The origins and impact of primate segmental duplications, Trends in Genetics, vol.25, pp.443-454, 2009.

M. Edward and . Mccreight, A space-economical suffix tree construction algorithm, Journal of the ACM (JACM), vol.23, pp.262-272, 1976.

T. David and . Miller, Microdeletion/duplication at 15q13. 2q13. 3 among individuals with features of autism and other neuropsychiatirc disorders, Journal of medical genetics, 2008.

C. Richard, . Moore, and . Michael-d-purugganan, The evolutionary dynamics of plant duplicate genes, Current opinion in plant biology, vol.8, pp.122-128, 2005.

A. Morgulis, A fast and symmetric DUST implementation to mask low-complexity DNA sequences, Journal of Computational Biology, vol.13, issue.5, pp.1028-1040, 2006.

Y. Mori, The divsufsort library

, MUMmer online manual

C. Münch, Evolutionary analysis of the highly dynamic CHEK2 duplicon in anthropoids, BMC evolutionary biology, vol.8, p.269, 2008.

W. Eugene, W. Myers, and . Miller, Optimal alignments in linear space, Bioinformatics, vol.4, pp.11-17, 1988.

B. Saul, . Needleman, D. Christian, and . Wunsch, A general method applicable to the search for similarities in the amino acid sequence of two proteins, Journal of molecular biology, vol.48, pp.443-453, 1970.

L. Noé and G. Kucherov, YASS: enhancing the sensitivity of DNA similarity search, Nucleic acids research, vol.33, pp.540-543, 2005.

P. Nyrén, B. Pettersson, and M. Uhlén, Solid phase DNA minisequencing by an enzymatic luminometric inorganic pyrophosphate detection assay, Analytical biochemistry, vol.208, pp.171-175, 1993.

C. Oehmen and J. Nieplocha, ScalaBLAST: A scalable implementation of BLAST for high-performance data-intensive bioinformatics analysis, IEEE Transactions on Parallel and Distributed Systems, vol.17, issue.8, pp.740-749, 2006.

P. Raeymaekers, Duplication in chromosome 17p11. 2 in Charcot-Marie-Tooth neuropathy type 1a (CMT 1a), Neuromuscular disorders, vol.1, issue.2, pp.93-97, 1991.

V. Régnier, Emergence and scattering of multiple neurofibromatosis (NF1)-related sequences during hominoid evolution suggest a process of pericentromeric interchromosomal transposition, Human molecular genetics, vol.6, pp.9-16, 1997.

S. Repping, Recombination between palindromes P5 and P1 on the human Y chromosome causes massive deletions and spermatogenic failure, The American Journal of Human Genetics, vol.71, pp.906-922, 2002.

T. Mark and . Ross, The DNA sequence of the human X chromosome, Nature, vol.434, p.325, 2005.

A. Owen and . Ross, Genomic investigation of ?-synuclein multiplication and parkinsonism, Annals of Neurology: Official Journal of the American Neurological Association and the Child Neurology Society, vol.63, pp.743-750, 2008.

A. Rovelet-lecrux, APP locus duplication causes autosomal dominant earlyonset Alzheimer disease with cerebral amyloid angiopathy, Nature genetics, vol.38, p.24, 2006.

S. Rozen, Abundant gene conversion between arms of palindromes in human and ape Y chromosomes, Nature, vol.423, p.873, 2003.

?. ?????????? and . ?????????, ???????? ???? ? ???????????? ?????????, ??????? ? ????????? ????????, ??????? ???????? ????, vol.163, pp.845-848, 1965.

K. Rutherford, Artemis: sequence visualization and annotation, Bioinformatics, vol.16, pp.944-945, 2000.

S. Besenbacher, Direct estimation of mutations in great apes reconciles phylogenetic dating, Nature Ecology & Evolution, 2019.

J. Stephan and . Sanders, Multiple recurrent de novo CNVs, including duplications of the 7q11. 23 Williams syndrome region, are strongly associated with autism, Neuron, vol.70, pp.863-885, 2011.

F. Sanger, S. Nicklen, and A. R. Coulson, DNA sequencing with chain-terminating inhibitors, Proceedings of the national academy of sciences, vol.74, pp.5463-5467, 1977.

F. Sanger, Nucleotide sequence of bacteriophage ?X174 DNA". In: nature 265, vol.5596, p.687, 1977.

A. Scally, The mutation rate in human evolution and demographic inference, Current opinion in genetics & development, vol.41, pp.36-43, 2016.

U. Klaus, S. Schulz, and . Mihov, Fast string correction with Levenshtein automata, International Journal on Document Analysis and Recognition, vol.5, pp.67-85, 2002.

D. Sharma, Spectral Repeat Finder (SRF): identification of repetitive sequences using Fourier transformation, Bioinformatics, vol.20, pp.1405-1412, 2004.

J. Andrew and . Sharp, Segmental duplications and copy-number variation in the human genome, The American Journal of Human Genetics, vol.77, pp.78-88, 2005.

X. She, Mouse segmental duplication and copy number variation, Nature genetics 40, vol.7, p.909, 2008.

J. Shun, E. Guy, and . Blelloch, A simple parallel cartesian tree algorithm and its application to parallel suffix tree construction, ACM Transactions on Parallel Computing, vol.1, issue.1, p.8, 2014.

U. Amit, J. Sinha, and . Meller, Cinteny: flexible analysis and visualization of synteny and genome rearrangements in multiple organisms, BMC bioinformatics, vol.8, p.82, 2007.

H. Skaletsky, The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes, Nature, vol.423, p.825, 2003.

F. A. Arian, R. Smit, P. Hubley, . Green, and . Repeatmasker, , 1996.

M. Spitzer, VisCoSe: visualization and comparison of consensus sequences, Bioinformatics, vol.20, pp.433-435, 2004.

H. Peter and . Sudmant, Diversity of human copy number variation and multicopy genes, Science, vol.330, pp.641-646, 2010.

W. Sung, Evolution of the insertion-deletion mutation rate across the tree of life, G3: Genes, Genomes, pp.3-116, 2016.

B. Paul, S. Talbert, and . Henikoff, Centromeres convert but don't cross, PLoS biology, vol.8, p.1000326, 2010.

M. Rick and . Tankard, Detecting known repeat expansions with standard protocol next generation sequencing, towards developing a single screening test for neurological repeat expansion disorders, p.157792, 2017.

S. John and . Taylor, Comparative genomics provides evidence for an ancient genome duplication event in fish, Philosophical Transactions of the Royal Society of London B: Biological Sciences, vol.356, pp.1661-1679, 2001.

S. John and . Taylor, Genome duplication, a trait shared by 22,000 species of rayfinned fish, Genome research, vol.13, pp.382-390, 2003.

S. Levi and . Teitz, Selection Has Countered High Mutability to Preserve the Ancestral Copy Number of Y Chromosome Amplicons in Diverse Human Lineages, The American Journal of Human Genetics, vol.103, pp.261-275, 2018.

T. Ensembl-genome and . Browser, , vol.1, pp.1-249250621

, The jq tool

. The-ncbi-genome and . Browser,

, The OpenCL specification

, The SQL language specification

. The-ucsc-genome and . Browser,

, The Vmatch large scale sequence analysis software

E. Ukkonen, On-line construction of suffix trees, Algorithmica 14, vol.3, pp.249-260, 1995.

, Unipro UGene Software Suite

C. Venter, The sequence of the human genome, science 291, vol.5507, pp.1304-1351, 2001.
URL : https://hal.archives-ouvertes.fr/hal-00465088

N. Volfovsky, J. Brian, S. L. Haas, and . Salzberg, A clustering method for repeat analysis in DNA sequences, Genome Biology, vol.2, pp.27-28, 2001.

. Ms-waterman, Identification of common molecular subsequence, Mol. Biol, vol.147, pp.195-197, 1981.

P. Weiner, Linear pattern matching algorithms, Switching and Automata Theory, 1973. SWAT'08. IEEE Conference Record of 14th Annual Symposium on

. Ieee, , pp.1-11, 1973.

M. Yoshimoto, PTEN genomic deletions that characterize aggressive prostate cancer originate close to segmental duplications, Genes, Chromosomes and Cancer, vol.51, pp.149-160, 2012.

Z. Zhang, A greedy algorithm for aligning DNA sequences, Journal of Computational biology, vol.7, issue.1-2, pp.203-214, 2000.

Q. Zhao, Segmental duplications in the silkworm genome, BMC genomics, vol.14, p.521, 2013.