, Molecular Biology of the Cell. 4th. Garland, 2002.
,
Basic local alignment search tool, Journal of molecular biology, vol.215, pp.403-410, 1990. ,
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic acids research, vol.25, pp.3389-3402, 1997. ,
Parallel construction of a suffix tree with applications, Algorithmica 3, pp.347-365, 1988. ,
Detecting long tandem duplications in genomic sequences, BMC bioinformatics, vol.13, p.83, 2012. ,
Y-chromosome evolution: emerging insights into processes of Ychromosome degeneration, Nature Reviews Genetics, vol.14, p.113, 2013. ,
Edit distance cannot be computed in strongly subquadratic time (unless SETH is false, Proceedings of the forty-seventh annual ACM symposium on Theory of computing, pp.51-58, 2015. ,
Recent advances in the detection of repeat expansions with short-read next-generation sequencing, pp.1000-1007, 2018. ,
Primate segmental duplications: crucibles of evolution, diversity and disease, Nature Reviews Genetics, vol.7, issue.7, p.552, 2006. ,
An Alu transposition model for the origin and expansion of human segmental duplications, The American Journal of Human Genetics, vol.73, pp.823-834, 2003. ,
Recent segmental duplications in the human genome, Science, vol.297, pp.1003-1007, 2002. ,
Segmental duplications: organization and impact within the current human genome project assembly, Genome research, vol.11, pp.1005-1017, 2001. ,
Dynamic nature of the proximal AZFc region of the human Y chromosome: multiple independent deletion and duplication events revealed by microsatellite analysis, Human mutation, vol.29, pp.1171-1180, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-00393071
Automated de novo identification of repeat sequence families in sequenced genomes, Genome research, vol.12, pp.1269-1276, 2002. ,
MaskerAid: a performance enhancement to RepeatMasker, Bioinformatics, vol.16, pp.1040-1041, 2000. ,
Tandem repeats finder: a program to analyze DNA sequences, Nucleic acids research, vol.27, pp.573-580, 1999. ,
Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes, The plant cell, vol.16, pp.1667-1678, 2004. ,
Extensive duplication and reshuffling in the Arabidopsis genome, The Plant Cell, vol.12, pp.1093-1101, 2000. ,
, CALMIP
BLAST+: architecture and applications, BMC bioinformatics, vol.10, p.421, 2009. ,
PRAP: an ab initio software package for automated genome-wide analysis of DNA repeats for prokaryotes, Bioinformatics, vol.29, pp.2683-2689, 2013. ,
Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence, Genome biology, vol.4, p.25, 2003. ,
Recent segmental and gene duplications in the mouse genome, Genome biology, vol.4, p.47, 2003. ,
,
Segmental duplications and evolutionary plasticity at tumor chromosome break-prone regions, Genome research, 2008. ,
Using MUMmer to identify similar regions in large sequence sets, Current protocols in bioinformatics, vol.1, pp.10-13, 2003. ,
Alignment of whole genomes, Nucleic acids research, vol.27, pp.2369-2376, 1999. ,
Fast algorithms for large-scale genome alignment and comparison, Nucleic acids research, vol.30, pp.2478-2483, 2002. ,
ASGART: fast and parallel genome scale segmental duplications mapping, Bioinformatics, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-02413201
DuplicationDetector, a light weight tool for duplication detection using NGS data, Current Plant Biology, vol.9, pp.23-28, 2017. ,
The evolution of trichromatic color vision by opsin gene duplication in New World and Old World primates, Genome research 9, vol.7, pp.629-638, 1999. ,
Biased gene conversion and the evolution of mammalian genomic landscapes, Annual review of genomics and human genetics, vol.10, pp.285-311, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00428399
PILER: identification and classification of genomic repeats, Bioinformatics 21.suppl_1, pp.152-158, 2005. ,
Widening the spectrum of human genetic variation, Nature genetics, vol.38, p.9, 2006. ,
, EMBOSS Stretcher online manual
Optimal logarithmic time randomized suffix tree construction, International Colloquium on Automata, Languages, and Programming, pp.550-561, 1996. ,
String matching in hardware using the FM-Index, Field-Programmable Custom Computing Machines (FCCM), 2011 IEEE 19th Annual International Symposium on. IEEE, pp.218-225, 2011. ,
Opportunistic data structures with applications, Foundations of Computer Science, 2000. Proceedings. 41st Annual Symposium on. IEEE, pp.390-398, 2000. ,
Average binary search length for dense ordered lists, Communications of the ACM, vol.14, pp.602-603, 1971. ,
Complex SNP-related sequence variation in segmental genome duplications, Nature genetics, vol.36, p.861, 2004. ,
Analysis of the recombination landscape of hexaploid bread wheat reveals genes controlling recombination and gene conversion frequency, bioRxiv, p.539684, 2019. ,
, GFF2 file format specification
, GFF3 file format specification
Analysis of high-identity segmental duplications in the grapevine genome, BMC genomics, vol.12, p.436, 2011. ,
Red: an intelligent, rapid, accurate tool for detecting repeats de-novo on the genomic scale, BMC bioinformatics, vol.16, p.227, 2015. ,
Whole-genome duplication in teleost fishes and its evolutionary consequences, Molecular genetics and genomics, vol.289, pp.1045-1060, 2014. ,
Coming of age: ten years of next-generation sequencing technologies, Nature Reviews Genetics, vol.17, p.333, 2016. ,
DAGchainer: a tool for mining segmental genome duplications and synteny, Bioinformatics, vol.20, pp.3643-3646, 2004. ,
Accurate identification of orthologous segments among multiple genomes, Bioinformatics, vol.25, pp.853-860, 2009. ,
Accelerated rate of gene gain and loss in primates, Genetics, 2007. ,
Recombination dynamics of a human Y-chromosomal palindrome: rapid GC-biased gene conversion, multi-kilobase conversion tracts, and rare inversions, PLoS genetics 9, vol.7, p.1003666, 2013. ,
Error detecting and error correcting codes, Bell System technical journal, vol.29, pp.147-160, 1950. ,
A linear space algorithm for computing maximal common subsequences, Communications of the ACM, vol.18, pp.341-343, 1975. ,
Mutation rates for unbalanced Robertsonian translocations associated with Down syndrome. Evidence for a temporal change in New York State live births 1968-1977, In: American journal of human genetics, vol.33, p.443, 1981. ,
Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content, Nature, vol.463, p.536, 2010. ,
Sequencing of rhesus macaque Y chromosome clarifies origins and evolution of the DAZ (Deleted in AZoospermia) genes, Bioessays, vol.34, pp.1035-1044, 2012. ,
Sex chromosome-to-autosome transposition events counter Y-chromosome gene loss in mammals, Genome biology, vol.16, p.104, 2015. ,
?-Synuclein gene rearrangements in dominantly inherited parkinsonism: frequency, phenotype, and mechanisms, Archives of neurology, vol.66, pp.102-108, 2009. ,
Intense and highly localized gene conversion activity in human meiotic crossover hot spots, Nature genetics, vol.36, issue.2, p.151, 2004. ,
Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution, Nature genetics, vol.39, p.1361, 2007. ,
Human evolutionary genetics: origins, peoples & disease. Garland Science, 2013. ,
Recurrent duplication-driven transposition of DNA during hominoid evolution, Proceedings of the National Academy of Sciences 103, pp.17626-17631, 2006. ,
Characterization of individual polynucleotide molecules using a membrane channel, Proceedings of the National Academy of Sciences 93, vol.24, pp.13770-13773, 1996. ,
Adaptive seeds tame genomic sequence comparison, Genome research, p.113985, 2011. ,
Linear-time construction of suffix arrays, Annual Symposium on Combinatorial Pattern Matching, pp.186-199, 2003. ,
Space efficient linear time construction of suffix arrays, Journal of Discrete Algorithms, vol.3, pp.143-156, 2005. ,
mreps: efficient and flexible detection of tandem repeats in DNA, Nucleic acids research, vol.31, pp.3672-3678, 2003. ,
URL : https://hal.archives-ouvertes.fr/inria-00099597
A prominent role for segmental duplications in modeling eukaryotic genomes, Comptes Rendus Biologies, vol.332, pp.254-266, 2009. ,
Results of estimation of mutation rates for translocation trisomy 21, Tsitologiia 44, vol.11, pp.1115-1119, 2002. ,
Recurrent 16p11. 2 microdeletions in autism, Human molecular genetics, vol.17, pp.628-638, 2007. ,
REPuter: fast computation of maximal repeats in complete genomes, Bioinformatics, vol.15, pp.426-427, 1999. ,
Versatile and open software for comparing large genomes, Genome biology, vol.5, p.12, 2004. ,
Four evolutionary strata on the human X chromosome, Science, vol.286, pp.964-967, 1999. ,
Zero-mode waveguides for single-molecule analysis at high concentrations". In: science 299, vol.5607, pp.682-686, 2003. ,
RECON: a program for prediction of nucleosome formation potential, Nucleic acids research, vol.32, pp.346-349, 2004. ,
Fast construction of FM-index for long sequence reads, Bioinformatics, vol.30, pp.3274-3275, 2014. ,
Optimal in-place suffix sorting, International Symposium on String Processing and Information Retrieval, pp.268-284, 2018. ,
Design and implementation of a CUDA-compatible GPU-based core for gapped BLAST algorithm, Procedia Computer Science, vol.1, issue.1, pp.495-504, 2010. ,
Rapid and sensitive protein similarity searches, Science, vol.227, pp.1435-1441, 1985. ,
Tetrad analysis in plants and fungi finds large differences in gene conversion rates but no GC bias, Nature ecology & evolution, vol.2, issue.1, p.164, 2018. ,
Suffix arrays: a new method for on-line string searches, In: siam Journal on Computing, vol.22, pp.935-948, 1993. ,
The origins and impact of primate segmental duplications, Trends in Genetics, vol.25, pp.443-454, 2009. ,
A space-economical suffix tree construction algorithm, Journal of the ACM (JACM), vol.23, pp.262-272, 1976. ,
Microdeletion/duplication at 15q13. 2q13. 3 among individuals with features of autism and other neuropsychiatirc disorders, Journal of medical genetics, 2008. ,
The evolutionary dynamics of plant duplicate genes, Current opinion in plant biology, vol.8, pp.122-128, 2005. ,
A fast and symmetric DUST implementation to mask low-complexity DNA sequences, Journal of Computational Biology, vol.13, issue.5, pp.1028-1040, 2006. ,
The divsufsort library ,
, MUMmer online manual
Evolutionary analysis of the highly dynamic CHEK2 duplicon in anthropoids, BMC evolutionary biology, vol.8, p.269, 2008. ,
Optimal alignments in linear space, Bioinformatics, vol.4, pp.11-17, 1988. ,
A general method applicable to the search for similarities in the amino acid sequence of two proteins, Journal of molecular biology, vol.48, pp.443-453, 1970. ,
YASS: enhancing the sensitivity of DNA similarity search, Nucleic acids research, vol.33, pp.540-543, 2005. ,
Solid phase DNA minisequencing by an enzymatic luminometric inorganic pyrophosphate detection assay, Analytical biochemistry, vol.208, pp.171-175, 1993. ,
ScalaBLAST: A scalable implementation of BLAST for high-performance data-intensive bioinformatics analysis, IEEE Transactions on Parallel and Distributed Systems, vol.17, issue.8, pp.740-749, 2006. ,
Duplication in chromosome 17p11. 2 in Charcot-Marie-Tooth neuropathy type 1a (CMT 1a), Neuromuscular disorders, vol.1, issue.2, pp.93-97, 1991. ,
Emergence and scattering of multiple neurofibromatosis (NF1)-related sequences during hominoid evolution suggest a process of pericentromeric interchromosomal transposition, Human molecular genetics, vol.6, pp.9-16, 1997. ,
Recombination between palindromes P5 and P1 on the human Y chromosome causes massive deletions and spermatogenic failure, The American Journal of Human Genetics, vol.71, pp.906-922, 2002. ,
The DNA sequence of the human X chromosome, Nature, vol.434, p.325, 2005. ,
Genomic investigation of ?-synuclein multiplication and parkinsonism, Annals of Neurology: Official Journal of the American Neurological Association and the Child Neurology Society, vol.63, pp.743-750, 2008. ,
APP locus duplication causes autosomal dominant earlyonset Alzheimer disease with cerebral amyloid angiopathy, Nature genetics, vol.38, p.24, 2006. ,
Abundant gene conversion between arms of palindromes in human and ape Y chromosomes, Nature, vol.423, p.873, 2003. ,
???????? ???? ? ???????????? ?????????, ??????? ? ????????? ????????, ??????? ???????? ????, vol.163, pp.845-848, 1965. ,
Artemis: sequence visualization and annotation, Bioinformatics, vol.16, pp.944-945, 2000. ,
Direct estimation of mutations in great apes reconciles phylogenetic dating, Nature Ecology & Evolution, 2019. ,
Multiple recurrent de novo CNVs, including duplications of the 7q11. 23 Williams syndrome region, are strongly associated with autism, Neuron, vol.70, pp.863-885, 2011. ,
DNA sequencing with chain-terminating inhibitors, Proceedings of the national academy of sciences, vol.74, pp.5463-5467, 1977. ,
Nucleotide sequence of bacteriophage ?X174 DNA". In: nature 265, vol.5596, p.687, 1977. ,
The mutation rate in human evolution and demographic inference, Current opinion in genetics & development, vol.41, pp.36-43, 2016. ,
Fast string correction with Levenshtein automata, International Journal on Document Analysis and Recognition, vol.5, pp.67-85, 2002. ,
Spectral Repeat Finder (SRF): identification of repetitive sequences using Fourier transformation, Bioinformatics, vol.20, pp.1405-1412, 2004. ,
Segmental duplications and copy-number variation in the human genome, The American Journal of Human Genetics, vol.77, pp.78-88, 2005. ,
Mouse segmental duplication and copy number variation, Nature genetics 40, vol.7, p.909, 2008. ,
A simple parallel cartesian tree algorithm and its application to parallel suffix tree construction, ACM Transactions on Parallel Computing, vol.1, issue.1, p.8, 2014. ,
Cinteny: flexible analysis and visualization of synteny and genome rearrangements in multiple organisms, BMC bioinformatics, vol.8, p.82, 2007. ,
The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes, Nature, vol.423, p.825, 2003. ,
, , 1996.
VisCoSe: visualization and comparison of consensus sequences, Bioinformatics, vol.20, pp.433-435, 2004. ,
Diversity of human copy number variation and multicopy genes, Science, vol.330, pp.641-646, 2010. ,
Evolution of the insertion-deletion mutation rate across the tree of life, G3: Genes, Genomes, pp.3-116, 2016. ,
Centromeres convert but don't cross, PLoS biology, vol.8, p.1000326, 2010. ,
Detecting known repeat expansions with standard protocol next generation sequencing, towards developing a single screening test for neurological repeat expansion disorders, p.157792, 2017. ,
Comparative genomics provides evidence for an ancient genome duplication event in fish, Philosophical Transactions of the Royal Society of London B: Biological Sciences, vol.356, pp.1661-1679, 2001. ,
Genome duplication, a trait shared by 22,000 species of rayfinned fish, Genome research, vol.13, pp.382-390, 2003. ,
Selection Has Countered High Mutability to Preserve the Ancestral Copy Number of Y Chromosome Amplicons in Diverse Human Lineages, The American Journal of Human Genetics, vol.103, pp.261-275, 2018. ,
, , vol.1, pp.1-249250621
, The jq tool
,
, The OpenCL specification
, The SQL language specification
,
, The Vmatch large scale sequence analysis software
On-line construction of suffix trees, Algorithmica 14, vol.3, pp.249-260, 1995. ,
, Unipro UGene Software Suite
The sequence of the human genome, science 291, vol.5507, pp.1304-1351, 2001. ,
URL : https://hal.archives-ouvertes.fr/hal-00465088
A clustering method for repeat analysis in DNA sequences, Genome Biology, vol.2, pp.27-28, 2001. ,
Identification of common molecular subsequence, Mol. Biol, vol.147, pp.195-197, 1981. ,
Linear pattern matching algorithms, Switching and Automata Theory, 1973. SWAT'08. IEEE Conference Record of 14th Annual Symposium on ,
, , pp.1-11, 1973.
PTEN genomic deletions that characterize aggressive prostate cancer originate close to segmental duplications, Genes, Chromosomes and Cancer, vol.51, pp.149-160, 2012. ,
A greedy algorithm for aligning DNA sequences, Journal of Computational biology, vol.7, issue.1-2, pp.203-214, 2000. ,
Segmental duplications in the silkworm genome, BMC genomics, vol.14, p.521, 2013. ,