J. Allen, D. Hunnicut, and . Klatt, From Text to Speech, the MITTALK system, 1987.

M. Astrinaki, Reactive and continuous control of HMM-based speech synthesis, 2012 IEEE Spoken Language Technology Workshop (SLT), pp.252257-77, 2012.
DOI : 10.1109/SLT.2012.6424231

URL : http://tcts.fpms.ac.be/~drugman/files/SLT12-Astrinaki.pdf

M. Astrinaki, Performative Statistical Parametric Speech Synthesis Applied to Interactive Designs, Thèse de doct, pp.76-104, 2014.

G. Bailly and . Et-mamoun-alissali, Compost : un serveur de synthèse de parole multilingue, Traitement du Signal 9, pp.359366-359384, 1992.

G. Bailly and I. Gorisch, Generating German intonation with a trainable prosodic model, Proceedings of Interspeech, pp.23662369-84, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00366490

G. Bailly and C. Gouvernayre, Pauses and respiratory markers of the structure of book reading, Proceedings of Interspeech, pp.2218-2221, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00741667

S. Bangalore, Real-time incremental speech-to-speech translation of dialogs, Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, pp.437445-106, 2012.

T. Baumann, Decision tree usage for incremental parametric speech synthesis, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.38193823-74, 2014.
DOI : 10.1109/ICASSP.2014.6854316

T. Baumann, Incremental Spoken Dialogue Processing : Architecture and Lowerlevel Components, Thèse de doct, 2013.

T. Baumann and D. Schlangen, Evaluating Prosodic Processing for Incremental Speech Synthesis, Proceedings of Interspeech, pp.438441-76, 2012.

N. Beuck, A. Köhn, and W. Menzel, Decision Strategies in Incremental PoS Tagging, Proceedings of the 18th Nordic Conference of Computational Linguistics (NODALIDA), pp.2633-2645, 2011.

J. A. Bilmes, A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models, In : International Computer Science Institute, vol.4510, pp.126-55, 1998.

P. Blache and S. Rauzy, Le module de reformulation iconique de la Plateforme de Communication Alternative, Actes de la 14ème conférence sur le Traitement Automatique des Langues Naturelles (TALN), pp.519528-519534, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00173546

J. Bloit and X. Rodet, Short-time Viterbi for online HMM decoding: Evaluation on a real-time phone recognition task, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.21212124-21212150, 2008.
DOI : 10.1109/ICASSP.2008.4518061

URL : https://hal.archives-ouvertes.fr/hal-01161222

F. Bocquelet, Toward a Brain-Computer Interface for speech rehabilitation, 2017.

R. Boite, Traitement de la parole, Presses Polytechniques et Universitaires Romandes (cf. p. 8, p.43, 2000.

D. Braga, L. Coelho, and F. Gil-vianna-resende, A rule-based grapheme-to-phone converter for tts systems in european portuguese, 2006 International Telecommunications Symposium, pp.328333-328353, 2006.
DOI : 10.1109/ITS.2006.4433293

T. Brants, TnT, Proceedings of the sixth conference on Applied natural language processing -, pp.224231-224249, 2000.
DOI : 10.3115/974147.974178

L. Breiman, Classication and regression trees, p.30, 1984.

D. Büring, Syntax, information structure, and prosody, pp.860895-56, 2013.
DOI : 10.1017/CBO9780511804571.029

H. Buschmeier and S. Kopp, Towards Conversational Agents That Attend to and Adapt to Communicative User Feedback, Proceedings of the 11th International Conference on Intelligent Virtual Agents, pp.169182-169193, 2011.
DOI : 10.1007/978-3-642-23974-8_19

URL : https://www.techfak.uni-bielefeld.de/%7Ehbuschme/publications/BuschmeierKopp-2011-IVA-preprint.pdf

D. Cadic, Optimised voice creation for unit-selection synthesis. Theses, p.12, 2011.
URL : https://hal.archives-ouvertes.fr/tel-01085379

H. Che, J. Tao, and S. Pan, Letter-to-sound conversion using coupled Hidden Markov Models for lexicon compression, 2012 International Conference on Speech Database and Assessments, pp.141-144, 2012.
DOI : 10.1109/ICSDA.2012.6422464

S. F. Chen and J. Goodman, An empirical study of smoothing techniques for language modeling In : Computer Speech & Language 13, pp.359-394, 1999.

R. Collobert, Natural language processing (almost) from scratch, Journal of Machine Learning Research, vol.12, pp.24932537-24932555, 2011.

P. Combescure, listes de dix phrases phonétiquement équilibrées In : Revue d'acoustique 56 (cf, p.98, 1981.

M. Constant, Intégrer des connaissances linguistiques dans un CRF : application à l'apprentissage d'un segmenteur-étiqueteur du français, In : TALN, vol.1, 2011.

C. Alessandro, 33 ans de synthèse de la parole à partir du texte : une promenade sonore, pp.129-172, 1968.

D. , N. , and T. Dutoit, HandSketch bi-manual controller : investigation on expressive control issues of an augmented tablet, Proceedings of the 7th international conference on New interfaces for musical expression, pp.7881-7892, 2007.

D. Cheveigné, A. , and H. Kawahara, YIN, a fundamental frequency estimator for speech and music, The Journal of the Acoustical Society of America, vol.111, issue.4, pp.1917-50, 1930.
DOI : 10.1121/1.1458024

D. Cristo and A. , Interpréter la prosodie, p.1329, 2000.

T. Drugman, Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.3793-3796, 2009.
DOI : 10.1109/ICASSP.2009.4960453

URL : http://tcts.fpms.ac.be/~drugman/files/ICASSP09.pdf

T. Drugman, G. Wilfart, and T. Dutoit, A deterministic plus stochastic model of the residual signal for improved parametric speech synthesis, Proceedings of Interspeech, pp.17791782-50, 2009.

T. Dutoit, pHTS for Max/MSP : A Streaming Architecture for Statistical Parametric Speech Synthesis, QPSR of the numediart research program 4.1, pp.711-722, 2011.

J. Edlund, Incremental speech synthesis, Proceedings of Swedish Language Technology Conference, pp.5354-5365, 2008.

Y. Fan, Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.44754479-44754524, 2015.
DOI : 10.1109/ICASSP.2015.7178817

S. Ferrari and F. Cribari-neto, Beta Regression for Modelling Rates and Proportions, Journal of Applied Statistics, vol.31, issue.7, pp.799815-86, 2004.
DOI : 10.1080/0266476042000214501

URL : http://orfe.princeton.edu/~alaink/NJ_aTaxiOrf467F12/Papers/Machine Learning/beta.pdf

L. Feugère, Gestural control of singing voice synthesis by rules and musical applications. Theses, p.11, 2013.

G. Forney and . David, The viterbi algorithm, Proceedings of the IEEE 61, pp.268-278, 1973.
DOI : 10.1109/PROC.1973.9030

J. Giménez and L. Marquez, SVMTool : A general POS tagger generator based on Support Vector Machines, Proceedings of the 4th International Conference on Language Resources and Evaluation, p.18, 2004.

C. Gini, E. Pizetti, and T. Salvemini, Variabilità e mutabilità In : Reprinted in Memorie di metodologica statistica Rome : Libreria Eredi Virgilio Veschi 1 (cf, p.34, 1912.

B. B. Greene, M. Gerald, and . Rubin, Automated grammatical tagging of English, p.18, 1971.

M. Guéguin, Evaluation objective de la qualité vocale en contexte de conversation, Thèse de doct, 2006.

S. Hahn, Improving LVCSR with hidden conditional random elds for grapheme-to-phoneme conversion, Proceedings of Interspeech, pp.495-499, 2013.

S. Hochreiter and J. Schmidhuber, Long Short-Term Memory, Neural computation 9.8, pp.17351780-105, 1997.
DOI : 10.1016/0893-6080(88)90007-X

T. Hothorn, F. Bretz, and P. Westfall, Simultaneous Inference in General Parametric Models, Biometrical Journal, vol.16, issue.3, pp.346363-86, 2008.
DOI : 10.18637/jss.v016.i09

URL : http://epub.ub.uni-muenchen.de/2120/1/tr019.pdf

T. Hueber, Reconstitution de la parole par imagerie ultrasonore et vidéo de l'appareil vocal : vers une communication parlée silencieuse, Thèse de doct, pp.50-51, 2009.

A. J. Hunt, W. Alan, and . Black, Unit selection in a concatenative speech synthesis system using a large speech database, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, pp.373376-373421, 1996.
DOI : 10.1109/ICASSP.1996.541110

S. Imai, Cepstral analysis synthesis on the mel frequency scale, ICASSP '83. IEEE International Conference on Acoustics, Speech, and Signal Processing, 1983.
DOI : 10.1109/ICASSP.1983.1172250

N. Indurkhya, F. J. Damerau, D. David, and . Palmer, Text Preprocessing, Handbook of Natural Language Processing, p.9, 2010.

F. Jelinek and C. Chelba, Putting language into language modeling, 1999.

S. Jiampojamarn and G. Kondrak, Letter-phoneme alignment : An exploration, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp.780-788, 2010.

O. Karaali, G. Corrigan, and I. Gerson, Speech Synthesis with Neural Networks, World Congress on Neural Networks : International Neural Network Society 1996 Annual Meeting, pp.45-45, 1996.

C. Karat, Patterns of entry and correction in large vocabulary continuous speech recognition systems, Proceedings of the SIGCHI conference on Human factors in computing systems the CHI is the limit, CHI '99, pp.568575-95, 1999.
DOI : 10.1145/302979.303160

D. H. Klatt, Review of text???to???speech conversion for English, The Journal of the Acoustical Society of America, vol.82, issue.3, pp.737793-737837, 1987.
DOI : 10.1121/1.395275

J. Kominek, W. Et-alan, and . Black, The CMU Arctic speech databases, Proceedings of Fifth ISCA ITRW on Speech Synthesis (SSW5), 2004.

R. F. Kubichek, Mel-cepstral distance measure for objective speech quality assessment, Proceedings of IEEE Pacific Rim Conference on Communications Computers and Signal Processing, pp.125128-77, 1993.
DOI : 10.1109/PACRIM.1993.407206

P. Lanchantin, G. Degottex, and X. Rodet, A HMM-based speech synthesis system using a new glottal source and vocal-tract separation method, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.46304633-50, 2010.
DOI : 10.1109/ICASSP.2010.5495550

URL : https://hal.archives-ouvertes.fr/hal-01161230

T. Lavergne, O. Cappé, and F. Yvon, Practical Very Large Scale CRFs, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp.504513-504531, 2010.

L. Beux and . Sylvain, Gestural control of prosody and voice quality. Theses, p.11, 2009.
URL : https://hal.archives-ouvertes.fr/tel-00618427

L. Maguer and S. , Evaluation experimentale d'un systeme statistique de de la parole, HTS, pour la langue francaise, Thèse de doct, pp.56-78, 2013.
URL : https://hal.archives-ouvertes.fr/tel-00913565

M. E. Lesk and E. Schmidt, Lex : A lexical analyzer generator, NJ, p.17, 1975.

S. E. Levinson, J. P. Olive, and J. S. Tschirgi, Speech synthesis in telecommunications, IEEE Communications Magazine, vol.31, issue.11, pp.4653-4673, 1993.
DOI : 10.1109/35.256873

D. D. Lewis, Rcv1 : A new benchmark collection for text categorization research, Journal of machine learning research 5, pp.361397-361416, 2004.

W. Ling, Two/Too Simple Adaptations of Word2Vec for Syntax Problems, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2015.
DOI : 10.3115/v1/N15-1142

R. Maia, An excitation model for HMM-based speech synthesis based on residual modeling, Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007.

S. Mallat, A wavelet tour of signal processing Academic press (cf, p.59, 1999.

M. P. Marcus, M. A. Marcinkiewicz, and B. Santorini, Building a large annotated corpus of English : The Penn Treebank, Computational linguistics 19, pp.313330-313348, 1993.
DOI : 10.21236/ADA273556

T. Masuko, Voice characteristics conversion for HMM-based speech synthesis system, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.16111614-16111659, 1997.
DOI : 10.1109/ICASSP.1997.598807

URL : ftp://robert.ip.titech.ac.jp/pub/pdf/icassp97-synHMM.pdf

M. F. Mctear, Spoken dialogue technology : toward the conversational user interface, p.20, 2004.
DOI : 10.1007/978-0-85729-414-2

T. Mikolov, Distributed representations of words and phrases and their compositionality, Advances in Neural Information Processing Systems. Lake Tahoe, pp.31113119-105, 2013.

R. Mitton, A computer-usable dictionary le based on the Oxford Advanced Learner's Dictionary of Current English, 1992.

D. Mori, S. Matsubara, and Y. Inagaki, Incremental parsing for interactive natural language interface, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236), pp.28802885-28802897, 2001.
DOI : 10.1109/ICSMC.2001.971946

G. Nicol, Flex -The Lexical Scanner Generator. Free Software Foundation, Cambridge (cf, p.17, 1993.

G. Olaszy, G. Gordos, and . Németh, The MULTIVOX multilingual textto-speech converter. In : Talking machines : Theories, Models and Applications, pp.385411-385455, 1992.

V. Pagel, K. Lenzo, and A. Black, Letter to sound rules for accented lexicon compression, 5th International Conference on Spoken Language Processing, 1998.

I. Peretz, L. Krista, and . Hyde, What is specic to music processing ? Insights from congenital amusia, Trends in cognitive sciences 7, pp.362367-77, 2003.
DOI : 10.1016/s1364-6613(03)00150-5

URL : http://www.brams.umontreal.ca/plab/downloads/PeretzHyde03.pdf

O. Perrotin, Singing with hands : chironomic interfaces for digital musical instruments. Theses, p.12, 2015.
URL : https://hal.archives-ouvertes.fr/tel-01231209

H. R. Pfitzinger, Local Speech Rate As A Combination Of Syllable And Phone Rate, Proceedings of International Conference on Spoken Language Processing (ICSLP), pp.10871090-84, 1998.

M. Pouget, HMM Training Strategy for incremental speech synthesis, Proceedings of Interspeech, pp.12011205-12011218, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01228889

M. Pouget, Adaptive Latency for Part-of-Speech Tagging in Incremental Text-to-Speech Synthesis, Interspeech 2016, pp.2846-2850, 2016.
DOI : 10.21437/Interspeech.2016-165

URL : https://hal.archives-ouvertes.fr/hal-01374782

T. Raitio, Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.45644567-50, 2011.
DOI : 10.1109/ICASSP.2011.5947370

K. Rao, Grapheme-to-phoneme conversion using Long Short-Term Memory recurrent neural networks, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), p.42254229, 2015.
DOI : 10.1109/ICASSP.2015.7178767

URL : http://static.googleusercontent.com/media/research.google.com/en/us/pubs/archive/43264.pdf

M. Ribeiro, J. Sam, . Yamagishi, A. Robert, and . Clark, A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis, Proceedings of Interspeech, pp.15861590-60, 2015.

D. L. Richards, Telecommunication by Speech : The Transmission Performance of Telephone Networks, Butterworths. London (cf. p, vol.6, 1973.

C. P. Rosé, A. Roque, and D. Bhembe, An ecient incremental architecture for robust interpretation, Proceedings of the second international conference on Human Language Technology Research, pp.307312-307324, 2002.

M. Rossi, L'intonation : de l'acoustique à la sémantique. Klincksieck. Paris (cf, p.12, 1981.

Y. Sagisaka, Speech synthesis by rule using an optimal selection of nonuniform synthesis units, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.679682-679726, 1988.

C. Santos, B. Nogueira-dos, and . Zadrozny, Learning Character-level Representations for Part-of-Speech Tagging, Proceedings of the 31st International Conference on Machine Learning, pp.18181826-18181844, 2014.

M. Schröder and J. Trouvain, The German text-to-speech synthesis system MARY : A tool for research, development and teaching, International Journal of Speech Technology, vol.64, pp.365377-365395, 2003.

A. Seward, Low-latency incremental speech transcription in the synface project, Proceedings of Interspeech, pp.11411144-11411170, 2003.

K. Shinoda and T. Watanabe, MDL-based context-dependent subword modeling for speech recognition., THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN (E), vol.21, issue.2, pp.7986-63, 2000.
DOI : 10.1250/ast.21.79

N. Sokolovska, Ecient Learning of Sparse Conditional Random Fields for Supervised Sequence Labeling, IEEE Journal of Selected Topics in Signal Processing, vol.46, pp.953964-953982, 2010.
DOI : 10.1109/jstsp.2010.2076150

Y. Stylianou, Harmonic plus Noise Models for Speech combined with Statistical Methods, for Speech and Speaker Modication, Thèse de doct Ecole Nationale superieure des télécommunications (cf, p.47, 1996.

X. Sun, Structure regularization for structured prediction, Advances in Neural Information Processing Systems, pp.24022410-24022428, 2014.

M. Sundermeyer, The RWTH 2010 quaero ASR evaluation system for, Proceedings of IEEE International Conference on Acoustics , Speech, and Signal Processing (ICASSP). Prague, Czech Republic, p.22122215, 2011.
DOI : 10.1109/icassp.2011.5946920

A. Suni, Wavelets for intonation modeling in HMM speech synthesis In : 8th ISCA Speech Synthesis Workshop, pp.285290-59, 2013.

M. Tamura, Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), pp.805808-805853, 2001.
DOI : 10.1109/ICASSP.2001.941037

P. Taylor, W. Et-alan, and . Black, Concept-to-speech synthesis by phonological structure matching, Proceedings of Eurospeech, pp.623626-623670, 1999.
DOI : 10.1098/rsta.2000.0594

URL : http://crow.ee.washington.edu/people/bulyko/papers/Taylor_2000_a.ps

P. Taylor, A. W. Black, and R. Caley, The Architecture of the Festival Speech Synthesis System, Proceedings of the 3rd ESCA/COCOSDA Workshop on Speech Synthesis. Australia, pp.147151-147169, 1998.

K. Tokuda, Hidden Markov models based on multi-space probability distribution for pitch pattern modeling, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), p.229, 1999.
DOI : 10.1109/ICASSP.1999.758104

K. Tokuda, Speech parameter generation algorithms for HMM-based speech synthesis, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), pp.13151318-63, 2000.
DOI : 10.1109/ICASSP.2000.861820

K. Tokuda, Speech Synthesis Based on Hidden Markov Models, Proceedings of the IEEE 101, pp.12341252-12341297, 2013.
DOI : 10.1109/JPROC.2013.2251852

K. Tokuda, T. Masuko, and N. Miyazaki, Multi-Space Probability Distribution HMM, IEICE TRANSACTIONS on Information and Systems E85.3, pp.455464-59, 2002.

S. Toma and . Munteanu, Rule-based automatic phonetic transcription for the Romanian language, Computation World : Future Computing , Service Computation, pp.682686-682706, 2009.

. Toma, On letter to sound conversion for Romanian: A comparison of five algorithms, 2013 7th Conference on Speech Technology and Human, Computer Dialogue (SpeD), pp.16-20, 2013.
DOI : 10.1109/SpeD.2013.6682664

K. Toutanova, Feature-rich part-of-speech tagging with a cyclic dependency network, Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology , NAACL '03, pp.173180-173198, 2003.
DOI : 10.3115/1073445.1073478

C. Tsai, Hierarchical prosody modeling of English speech and its application to TTS, Proceedings of Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014 17th Oriental Chapter of the International Committee for the, pp.16-56, 2014.

J. Vaissière, La phonétique seconde édition révisée. Presses Universitaires de France (cf, pp.8-48, 2011.

D. Wang and S. King, Letter-to-sound pronunciation prediction using conditional random elds, IEEE Signal Processing Letters 18, p.122125, 2011.
DOI : 10.1109/lsp.2010.2098440

URL : http://www.cstr.inf.ed.ac.uk/downloads/publications/2011/wang_ieeesigprocletters2011.pdf

T. Yoshimura, Duration modeling for HMM-based speech synthesis, 1998.

S. J. Young, J. J. Odell, and P. C. Woodland, Tree-based state tying for high accuracy acoustic modelling, Proceedings of the workshop on Human Language Technology , HLT '94, pp.307312-60, 1994.
DOI : 10.3115/1075812.1075885

URL : http://edward.comp.lancs.ac.uk/acl/H/H94/H94-1062.pdf

S. Yu, Hidden semi-Markov models, Artificial Intelligence, vol.174, issue.2, pp.215-243, 2010.
DOI : 10.1016/j.artint.2009.11.011

F. Yvon, Une petite introduction au Traitement Automatique des Langues Naturelles (cf, p.17, 2010.

H. Zen, Acoustic Modeling in Statistical Parametric Speech Synthesis -From HMM to LSTM-RNN, Proceedings of Workshop on Machine Learning in Spoken Language Processing (MLSLP). Aizu-Wakamatsu city, 2015.

H. Zen and H. Sak, Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015.
DOI : 10.1109/ICASSP.2015.7178816

URL : http://static.googleusercontent.com/media/research.google.com/en/us/pubs/archive/43266.pdf

H. Zen, A. Senior, and M. Schuster, Statistical parametric speech synthesis using deep neural networks, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.79627966-79628011, 2013.

]. T. Baumann, D. Schlangen-alessandro, T. Wang, S. Chen, and I. , Incremental speech synthesis The INPROTK 2012 release MAGE: A Platform for Performative Speech Synthesis New Approach in Exploring Applications Beyond Text-To- Speech Syntax, information structure and prosody, " in The Cambridge Handbook of Generative Syntax, Proceedings of Swedish Language Technology Conference Proceedings of NAACL-HLT Workshop on Future Directions and Needs in the Spoken Dialog Community: Tools and Data Proceedings of The Listening Talker Workshop, pp.53-54, 2008.

C. Liao, T. Chiang, T. Baumann, T. Kobayashi, and . Kitamura, Hierarchical prosody modeling of English speech and its application to TTS Decision tree usage for incremental parametric speech synthesis Partial Representations Improve the Prosody of Incremental Speech Synthesis Français, langue sans accent? Speech parameter generation algorithms for HMM-based speech synthesis, Proceedings of Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014 17th Oriental Chapter of the International Committee for the Proceedings of ICASSP Proceedings of Interspeech Proceedings of ICASSP, pp.1-6, 1980.

T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, Proceedings of Eurospeech, pp.2347-2350, 1999.

S. J. Young, J. J. Odell, and P. C. Woodland, Treebased State Tying for High Accuracy Acoustic Modelling, Proceedings of the Workshop on Human Language Technology, pp.307-312, 1994.
DOI : 10.3115/1075812.1075885

URL : http://edward.comp.lancs.ac.uk/acl/H/H94/H94-1062.pdf

K. Shinoda and T. Watanabe, MDL-based contextdependent subword modeling for speech recognition, J
DOI : 10.1250/ast.21.79

G. Bailly and C. Gouvernayre, Pauses and respiratory markers of the structure of book reading, Proceedings of Interspeech, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00741667

M. Alissali and G. Bailly, COMPOST: a client-server model for applications using text-to-speech systems, Proceedings of European Conference on Speech Communication and Technology, pp.2095-2098, 1993.

Y. Stylianou, Harmonic plus Noise Models for Speech combined with Statistical Methods, for Speech and Speaker Modification Ecole Nationale superieure des télécommunications The HTS toolkit, 1996.

T. Hueber, Reconstitution de la parole par imagerie ultrasonore et vide?o de l'appareil vocal : vers une communication parle?e silencieuse, 2009.

R. F. Kubichek, Mel-cepstral distance measure for objective speech quality assessment, Proceedings of IEEE Pacific Rim Conference on Communications Computers and Signal Processing, pp.125-128, 1993.
DOI : 10.1109/PACRIM.1993.407206

T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, An adaptive algorithm for mel-cepstral analysis of speech, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.137-140, 1992.
DOI : 10.1109/ICASSP.1992.225953

I. Peretz and K. L. Hyde, What is specific to music processing? Insights from congenital amusia, Trends in Cognitive Sciences, vol.7, issue.8, pp.362-367, 2003.
DOI : 10.1016/S1364-6613(03)00150-5

URL : http://www.brams.umontreal.ca/plab/downloads/PeretzHyde03.pdf

M. Astrinaki, N. Alessandro, B. Picart, T. Drugman, and T. Dutoit, Reactive and continuous control of HMM-based speech synthesis, 2012 IEEE Spoken Language Technology Workshop (SLT), pp.252-257
DOI : 10.1109/SLT.2012.6424231

N. W. Campbell, Segment durations in a syllable frame, J. Phon, vol.19, issue.1, pp.37-47, 1991.

S. and L. Maguer, Evaluation experimentale d'un systeme statistique de de la parole, HTS, pour la langue francaise, Universite?deUniversite?de Rennes, vol.1, 2013.
URL : https://hal.archives-ouvertes.fr/tel-00913565

G. Bailly and I. Gorisch, Generating German intonation with a trainable prosodic model, Proceedings of Interspeech, pp.2366-2369, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00366490

H. R. Pfitzinger, Local Speech Rate As A Combination Of Syllable And Phone Rate, Proceedings of International Conference on Spoken Language Processing (ICSLP), pp.1087-1090, 1998.

N. Beuck, A. Köhn, and W. Menzel, Predictive incremental parsing and its evaluation, Computational Dependency Theory, p.186, 2013.

J. Edlund, Incremental speech synthesis, Proceedings of Swedish Language Technology Conference, pp.53-54, 2008.

D. Schlangen and G. Skantze, A general, abstract model of incremental dialogue processing, Proceedings of the 12th Conference of the European Chapter, pp.710-718, 2009.

T. Baumann and D. Schlangen, The INPROTK 2012 release, Proceedings of the 12th Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp.29-32

M. Pouget, T. Hueber, G. Bailly, and T. Baumann, HMM training strategy for incremental speech synthesis, Proceedings of Interspeech, pp.1201-1205, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01228889

N. Beuck, A. Köhn, and W. Menzel, Decision Strategies in Incremental PoS Tagging, Proceedings of NODALIDA 2011, pp.26-33, 2011.

C. D. Manning and H. Schütze, Foundations of statistical natural language processing, 1999.

K. Toutanova and C. D. Manning, Enriching the knowledge sources used in a maximum entropy part-of-speech tagger, Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics -, pp.63-70, 2000.
DOI : 10.3115/1117794.1117802

T. Brants, TnT, Proceedings of the sixth conference on Applied natural language processing -, pp.224-231, 2000.
DOI : 10.3115/974147.974178

P. Taylor, A. W. Black, and R. Caley, The Architecture of the Festival Speech Synthesis System, Proceedings of the 3rd ESCA Workshop on Speech Synthesis, pp.147-151, 1998.

G. Bailly and M. Alissali, Compost : un serveur de synthèse de parole multilingue, Traitement du Signal, vol.9, issue.4, pp.359-366, 1992.

K. Tokuda, Y. Nankaku, T. Toda, H. Zen, J. Yamagishi et al., Speech Synthesis Based on Hidden Markov Models, Proceedings of the IEEE, vol.101, issue.5, pp.1234-1252, 2013.
DOI : 10.1109/JPROC.2013.2251852

P. Combescure, listes de dix phrases phonétiquement phonétiquement´phonétiquementéquilibrées, Revue d'acoustique, 1981.

N. Beuck, A. Köhn, and W. Menzel, Predictive incremental parsing and its evaluation, Computational Dependency Theory