]. N. Bibliographie-[-barbot2011, V. Barbot, O. Barreaud, L. Boëffard, A. Charonnat et al., Towards a versatile multi-layered description of speech corpora using algebraic relations Hmm-based european portuguese tts system Predictive model of segmental duration in french, Conference of the International Speech Communication Association (Interspeech) Proceedings of the European Conference on Speech Communication and Technology (Eurospeech) 109th ASA Meeting, pp.1501-1504, 1985.

E. Leonard, J. E. Baum, E. Leonard, T. Baum, G. Petrie et al., A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains ATLAS : A flexible and extensible architecture for linguistic annotation A formal framework for linguistic annotation Chatr : a generic speech synthesis system The Festival Speech Synthesis System -System documentation The blizzard challenge-2005 : Evaluating corpusbased speech synthesis on common datasets, Proceedings of the Language Resources and Evaluation Conference Proceedings of the conference on Computational linguistics. [Black2005] A.W. Black and K. Tokuda Proceedings of the European Conference on Speech Communication and Technology (Eurospeech), 2005. [Boeffard2012] Olivier Boeffard, Laure Charonnat, Sébastien Le Maguer, and Damien Lolive . Towards fully automatic annotation of audio books for tts Proceedings of the International Conference on Language Resources and Evaluation (LREC), pp.360-363, 1967.

]. A. Bibliographie-[-bonafonte2008, J. Bonafonte, I. Adell, S. Esquerra, A. Gallego et al., Corpus and voices for catalan speech synthesis, Proceedings of the International Conference on Language Resources and Evaluation (LREC), pp.3325-3329, 2008.

]. N. Braunschweiler2010, M. J. Braunschweiler, S. Gales, and . Buchholz, Lightly supervised recognition for automatic alignment of large coherent speech recordings

L. Calliope, . Parole, P. {. Masson, S. Cassidy, and J. Harrington, Emu : An enhanced hierarchical speech data management system [Cassidy2001] S Cassidy. Multi-level annotation in the Emu speech database management system [Charpentier1989] Francis Charpentier and Eric Moulines. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones A Perceptual Study of Acceleration Parameters in HMM-Based TTS Design of tree-based context clustering for an hmm-based thai speech synthesis system, Proc. of Interspeech Proc. of the 6th Australian Int. Speech Science and Technology Conf. Proceedings of the European Conference on Speech Communication and Technology (Eurospeech) Proceedings of the International Conference on Speech Communication and Technology (Interspeech), number September Proceedings of the Speech Synthesis Workshop (SSW), pp.2222-2225, 1989.

A. Cornuéjols, L. Di-cristodominguez1997, ]. A. Domínguez, M. Y. De-vegadonovan1995, ]. R. Donovan et al., The french review Interpréter la prosodie Lexical inhibition from syllabic units in visual word recognition Automatic speech synthesiser parameter estimation using hmms Trainable speech synthesis Phrase splicing and variable substitution using the ibm trainable speech synthesis system Remaking speech, XXIIèmes Journées d' ´ Etudes sur la Parole Proceedings of the international Conference on Acoustics, Speech and Signal Processing Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.1-14401, 1939.

[. Dutoit, V. Pagel, N. Pierret, F. Bataille, and O. Van-der-vrecken, The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.1393-1396, 1996.
DOI : 10.1109/ICSLP.1996.607874

]. D. Erro2010, I. Erro, I. Sainz, I. Luengo, J. Odriozola et al., Hmm-based speech synthesis in basque language using hts, Proceedings of the FALA, 2010.

G. Fant, Acoustic Theory of Speech Production. Mouton, The Hague, 1960.

[. Fónagy, Des fonctions de l'intonation : essai de synthèse, Flambeau, vol.29, pp.1-20, 2003.

K. Héì-ene-françois-toshiaki-fukada, T. Tokuda, S. Kobayashi, and . Imai, Synthèse de la parole par concaténation d'unités acoustiques : construction et exploitation d'une base de parole continue An adaptative Algorithm for mel-cepstral analysis of speech, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pp.137-140, 1992.

]. Z. Hanzlivcek2010 and . Hanzlí?ek, Czech hmm-based speech synthesis Using 5 ms segments in concatenative speech synthesis, Proceedings of the Text, Speech and Dialogue Conference (TSD) Fifth ISCA Workshop on Speech Synthesis, pp.291-298, 2004.

J. Andrew, A. W. Hunt, S. Black, K. Imai, C. Sumita et al., Unit selection in a concatenative speech synthesis system using a large speech database P800 : Methods for objective and subjective assessment of quality Mel log spectrum approximation (mlsa) filter for speech synthesis, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing Electronics and Communications in Japan (Part I : Communications)Imai1988] S. Imai. Unbiased estimator of log spectrum and its application to speech signal processing. Proceedings of EURASIP, pp.373-37610, 1983.

]. I. Ipsic2006, S. Ipsic, and . Martincic-ipsic, Croatian hmm-based speech synthesis, Journal of Computing and Information Technology, vol.14, issue.4, pp.307-313, 2006.

[. Kamina, S. Paris, P. Karabetsos, A. Tsiakoulis, S. Chalamandaris et al., Hmmbased speech synthesis for the greek language [Kawahara1999] Hideki Kawahara, Ikuyo Masuda-katsuse, and Alain De Cheveign. Restructuring speech representations using a pitch-adaptive time frequency smoothing and an instantaneous-frequency-based F0 extraction : Possible role of a repetitive structure in sounds 1 Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT Unit size in unit selection speech synthesis, Proceedings of the Text, Speech and Dialogue Conference (TSD)Kawahara2001] Hideki Kawahara, Jo Estill, and Osamu Fujimura Proceedings of the Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA) Proceedings of EUROSPEECH, pp.349-356187, 1999.

B. Alan, W. Black, S. Krstulovic, A. Hunecke, M. Schröder-akira-kurematsu et al., An hmm-based speech synthesis system applied to german and its adaptation to a limited set of expressive football announcements {ATR} japanese speech database as a tool of speech recognition and synthesis Continuous f0 in the source-excitation generation for hmm-based tts : Do we need voiced/unvoiced classification An hmm-based text-to-speech system applied to swedish, Proceedings of the European Conference on Speech Communication and Technology (Eurospeech) Proceedings of the international Conference on Acoustics, Speech and Signal Processing (ICASSP)Leon1992] Léon Pierre. Phonétisme et prononciations du français. Nathan-Université, 1992. [Lundgren2005] A. LundgrenKTH), 2005. [Maia2003] R. Maia, H. Zen, K. Tokuda, T. Kitamura, and F. Resende Jr. Towards the development of a brazilian portuguese text-to-speech system based on hmm Proceedings of the European Conference on Speech Communication and Technology (Eurospeech), pp.357-363, 1990.

]. F. Malfrere1998, T. Malfrère, P. Dutoit, K. Mertensmasuko1996-]-takashi-masuko, T. Tokuda et al., Fully automatic prosody generator for text-to-speech Speech synthesis using HMMs with dynamic features Accentuation, intonation et morphosyntaxe La synthèse de l'intonationàintonation`intonationà partir de structures syntaxiques riches, Proceedings of the International Conference on Spoken Language Processing Proceedings of the International Conference on Acoustics, Speech, and Signal ProcessingMoreno2009] P.J. Moreno and C. Alberti. A factor automaton approach for the forced alignment of long speech recordings Proc. of IEEE ICASSP, pp.389-39295, 1992.

]. E. Moulines1990, F. Moulines, D. Emerard, . Larreur, L. Jl-le-saint-milon et al., A real-time French text-to-speech system generating high-quality synthetic speech, International Conference on Acoustics, Speech, and Signal Processing, pp.309-312, 1990.
DOI : 10.1109/ICASSP.1990.115650

[. Odell, The use of context in large vocabulary speech recognition, 1995.

]. J. Pierrehumbert1990, P. B. Pierrehumbert-julia-hirschbergalessandro, F. De-mareuil-qian, Y. Soong, M. Chen et al., Intentions in communication Prosody synthesis by unit selection and transplantation on diphones An hmm-based mandarin chinese text-to-speech system A tutorial on hidden Markov models and selected applications in speech recognition Speaker identification and verification using Gaussian mixture speaker models Efficient spectral envelope estimation and its application to pitch shifting and envelope preservation Time and space-efficient architecture for a corpus-based text-to-speech synthesis system, Speech Synthesis Proceedings of 2002 IEEE Workshop on Chinese Spoken Language Processing Proceedings of the IEEE Proceedings of the 8th International Conference on Digital Audio EffectsRojc2007] Matej Rojc and Zdravko Ka?i?, pp.271-119, 1989.

[. Russell and R. Moore, The source-filter model lives (if you are careful) In Voice Foundation 37th Annual Symposium Explicit modelling of state occupancy in Hidden Markov Models for automatic speech recognition, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.5-8, 1985.

M. Shannon, H. Zen, W. Byrne, H. Silen, E. Helander et al., The Effect of Using Normalized Models in Statistical Speech Synthesis MDL-based context-dependent subword modeling for speech recognition Evaluation of finnish unit selection and hmm-based speech synthesis Analysis of Duration Prediction Accuracy in HMM-Based Speech Synthesis Prediction of Voice Aperiodicity Based on Spectral Representations in HMM Speech Synthesis, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing Proceedings of the International Conference on Speech Communication and Technology (Interspeech)Shinoda2000] Koichi Shinoda and Takao Wanabe Proceedings of the International Conference on Speech Communication and Technology (Interspeech), 2008. [Silen2010] Hanna Silén, Elina Helander, Jani Nurminen, and Moncef Gabbouj Proceedings of speech prosody, 2010. [Silen2011] Hanna Silen, Elina Helander, and Moncef Gabbouj Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech) : A standard for labeling english prosody Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp.679-682, 1988.

]. A. Bibliographie-[-simon2008, M. Simon, J. P. Avanzi, and . Goldman, La détection des proéminences syllabiques. un aller-retour entre l'annotation manuelle et le traitement automatique, Proceedings of Congrès Mondial de Linguistique Française, pp.1673-1686, 2008.

]. C. Sorin1984, M. Sorin, A. Stella, and . Aggoun, R` egles prosodiques et synthèse de parole " multi-style, Symposium Franco-Soviétique sur le Dialogue Home- Machine, 1984.

]. S. Stevens1937, J. Stevens, E. B. Volkmann, and . Newman, A Scale for the Measurement of the Psychological Magnitude Pitch, The Journal of the Acoustical Society of America, vol.8, issue.3, pp.185-190, 1937.
DOI : 10.1121/1.1915893

]. Y. Stylianou1998, O. Stylianou, E. Cappe, and . Moulines, Continuous probabilistic transform for voice conversion. Speech and Audio Processing, IEEE Transactions on, vol.6, issue.2, pp.131-142, 1998.

Y. Tao, L. Xueqing, W. Bian-alan, W. Black, and R. Caley, A dynamic alignment algorithm for imperfect speech and transcript, Proceedings of the ISCA Speech Synthesis Workshop (SSW), pp.75-84, 1998.
DOI : 10.2298/CSIS1001075T

]. P. Taylor2000, W. Taylor-america-alan, K. Black, T. Tokuda, and . Masuko, Text-to-speech synthesis Spectral Conversion Based on Maximum Likelehood Estimation Considering Global Variance of Converted Parameter Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis An algorithm for speech parameter generation from continuous mixture hmms with dynamic features Speech parameter generation from HMM using dynamic features Hidden Markov models based on multi-space probability distribution for pitch pattern modeling An hmm-based speech synthesis system applied to english, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ICASSP, 2005. [Toda2005a] Tomoki Toda and Keiichi Tokuda Proceedings of the International Conference on Speech Communication and Technology (Interspeech) Proceedings of the European Conference on Speech Communication and Technology (Eurospeech), 1995. [Tokuda1995a] Keiichi Tokuda, Takao Kobayashi, and Satoshi Imai Proceedings of the International Conference on Acoustics and Speech Signal Processing (ICASSP) Proceedings of the International Conference on Acoustics, Speech, and Signal ProcessingTokuda2000] Keiichi Tokuda, Heiga Zen, and Alan W Black Proceedings of the Speech Synthesis Workshop (SSW)Tokuda2000a] Keiichi Tokuda, Takashi Masuko, Noboru Miyazaki, and Takao Kobayashi . Multi-Space Probability Distribution HMM. IEICE Transactions on Information and Systems, pp.1697-1706, 1995.

K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, T. Tokuda et al., Speech parameter generation algorithms for hmmbased speech synthesis An HMM-based speech synthesis system applied to English Ircamcorpustools : an extensible plateform for speech corpora exploitation, ` a para??trepara??tre dans [Viterbi1967] A. Viterbi. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. Information Theory The Role of Higher-Level Linguistic Features in HMM-Based Speech Synthesis, Proceedings of the International Conference on Acoustics and Speech Signal Processing (ICASSP) Proceedings of the Language Resources and Evaluation Conference Junichi Yamagishi, and Simon King Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech), pp.1315-1318260, 1967.

Y. Wu and R. Wang, Minimum generation error training for hmmbased speech synthesis, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.89-92, 2006.

Y. Wu, K. Yamagishi, K. Takao, S. Renals, S. King et al., Adaptative training for hidden semi-Markov model An introduction to hmm-based speech synthesis Tokyo Institute of Technology Improved Average-Voice-based Speech Synthesis Using Gender-Mixed Modeling and a Parameter Generation Algorithm Considering GV Speaker-Independent HMM-based Speech Synthesis System -HTS-2007 System for the Blizzard Challenge The HTS-2008 System : Yet Another Evaluation of the Speaker- Adaptive HMM-based Speech Synthesis System in The 2008 Blizzard Challenge Robustness of hmm-based speech synthesis Evaluation of Prosodic Contextual Factors for HMM-Based Speech Synthesis Duration Modeling For HMM-Based Speech Synthesis, Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing Proceedings of the ISCA Tutorial and Research Workshop on Speech Synthesis (SSW6) Proceedings of the ISCA Tutorial and Research Workshop on Speech Synthesis (SSW6) Blizzard Challenge Proceedings of the International Conference on Speech Communication and Technology (Interspeech) Takashi Nose, and Takao Kobayashi proceedings of Interspeech Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp.365-368, 1998.

K. Bibliographie-[-yoshimura1999-]-takayoshi-yoshimura, T. Tokuda, T. Masuko, T. Kobayashi, K. Kitamura-takayoshi-yoshimura et al., Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis Mixed Excitation for HMM-based Speech Synthesis The htk hidden markov model toolkit : Design and philosophy, Proceedings of the European Conference on Speech Communicationand Technology (Eurospeech) Proceedings of the European Conference on Speech Communication and Technology (Eurospeech), pp.2347-2350, 1993.

J. Steve, . Young, J. Julian, . Odell, C. Phil et al., Tree-based state tying for high accuracy acoustic modelling, Proceedings of the workshop on Human Language Technology (HLT), pp.307-312, 1994.

[. Zellner, Caractérisation et prédiction du débit de parole en français, 1998.

H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, A Hidden Semi-Markov Model-Based Speech Synthesis System, Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp.1397-1400, 2004.
DOI : 10.1093/ietisy/e90-d.5.825

H. Zen, T. Zen, T. Toda, and K. Tokuda, An overview of Nitech HMM-based speech synthesis system for blizzard challenge The Nitech-NAIST HMMbased speech synthesis system for the Blizzard challenge, Proceedings of the 9th European Conference on Speech Communicationand Technology (Eurospeech) Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP). Nitech, 2006. [Zen2007a] Heiga Zen, Keiichi Tokuda, and Tadashi Kitamura. Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences. Computer Speech and Language, pp.153-173, 2005.

H. Zen, K. Tokuda, A. W. Black-olivier-boëffard, L. Charonnat, S. Le-maguer et al., Review : Statistical parametric speech synthesis Vers une annotation automatique de corpus audio pour la synthèse de parole ATALA/AFCP On the use of windows for harmonic analysis with the discrete fourier transform Automatic Building of Synthetic Voices from Large Multi-Paragraph Speech Databases, Proceedings of the Joint Conference JEP-TALN- RECITAL Proceedings of the IEEE Proc. of Interspeechspeech systems in french. XIème Congrès International des Sciences Phonétiques, pp.511039-1064, 1978.

]. K. Yu and S. Young, Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.5, pp.1071-1079, 2011.
DOI : 10.1109/TASL.2010.2076805