Luc Ardaillon, Gilles Degottex and Axel Roebel. A multilayer F0 model for singing voice synthesis using a B-spline representation with intuitive controls, Interspeech 2015, pp.2015-138, 2015. ,
Reactive and continuous control of HMM-based speech synthesis, 2012 IEEE Spoken Language Technology Workshop (SLT), pp.252-257, 2012. ,
DOI : 10.1109/SLT.2012.6424231
URL : http://tcts.fpms.ac.be/~drugman/files/SLT12-Astrinaki.pdf
Speech Analysis and Synthesis by Linear Prediction of the Speech Wave, The Journal of the Acoustical Society of America, vol.50, issue.2B, pp.637-655, 1971. ,
DOI : 10.1121/1.1912679
Analyzing linguistic data : A practical introduction to statistics using R, 2008. ,
DOI : 10.1017/CBO9780511801686
Theory, implementation and applications of nonstationary Gabor frames, Journal of Computational and Applied Mathematics, vol.236, issue.6, pp.1481-1496, 2011. ,
DOI : 10.1016/j.cam.2011.09.011
URL : https://doi.org/10.1016/j.cam.2011.09.011
Introducing the Geneva Multimodal expression corpus for experimental research on emotion perception., Emotion, vol.12, issue.5, pp.1161-166, 2012. ,
DOI : 10.1037/a0025827
Characterisation of rhythmic patterns for text-to-speech synthesis, Speech Communication, vol.15, issue.1-2, pp.127-137, 1994. ,
DOI : 10.1016/0167-6393(94)90047-7
URL : http://www.icp.inpg.fr/~bailly/publis/synthese/_pb/rythme_pb_SCOM94.ps
Abstractness in speech-metronome synchronisation : Pcentres as cyclic attractors, Interspeech, pp.1441-1444, 2005. ,
A model of segmental duration for speech synthesis in French, Speech Communication, vol.6, issue.3, pp.245-260, 1987. ,
DOI : 10.1016/0167-6393(87)90029-X
Steven Walkeret al. lme4 : Linear mixed-effects models using Eigen and S4. R package version, pp.1-23 ,
, Designing interaction, not interfaces, Proceedings of the working conference on Advanced visual interfaces, pp.15-22, 2004.
, Bibliographie
, Mieux penser les interfaces informatiques . 2016. (Cité page 98
, Alain Berthoz Sens du mouvement (le) Odile Jacob, p.63, 1997.
Paul Boersmaet al. Praat, a system for doing phonetics by computer, pp.341-345, 2002. ,
, Jordi Bonada Automatic technique in frequency domain for nearlossless time-scale modification of audio, International Computer Music Conference, 2000.
Remarques sur le siège de la faculté du langage articulé, suivies d'une observation d'aphémie (perte de la parole) Bulletin et, pp.330-357, 1861. ,
Gestural specification using dynamically-defined articulatory structures, pp.95-144, 1990. ,
Tiers in articulatory phonology, with some implications for casual speech. Papers in laboratory phonology I : Between the grammar and physics of speech, pp.341-376, 1990. ,
Articulatory Phonology: An Overview, Phonetica, vol.49, issue.3-4, pp.155-180, 1992. ,
DOI : 10.1159/000261913
, Mélanie Canault. L'émergence du contrôle articulatoire au stade du babillage. Une étude acoustique et cinématique, p.53, 2007.
The digital age and speech technology for Chinese language teaching and learning, Journal-Chinese Language Teachers Association, vol.38, issue.2, pp.49-86, 2003. ,
, Quinghai Chen Toward a Sequential Approach for Tonal Error Analysis, Deseret Language and Linguistic Society Symposium, pp.8-1993, 1993.
, Th Chiang Some interferences of English intonation with Chinese tones, IRAL : International Review of Applied Linguistics in Language Teaching, vol.17, issue.3, p.245, 1979.
Visualization of tone for learning Mandarin Chinese, Proceedings of the 4th Pronunciation in Second Language Learning and Teaching Conference, pp.77-89 ,
, Perry Cook Identification of control parameters in an articulatory vocal tract model, with applications to the synthesis of singing, 1991.
SPASM, a real-time vocal tract physical model controller ; and singer, the companion software synthesis system, Computer Music Journal, vol.17, issue.1, pp.30-44, 1993. ,
Real-time performance controllers for synthesized singing, Proceedings of the 2005 conference on New interfaces for musical expression, pp.236-237, 2005. ,
Handsketch bi-manual controller : Investigation on expressive control issues of an augmented tablet, Proceedings of the 7th international conference on New interfaces for musical expression, pp.78-81, 2007. ,
Advanced techniques for vertical tablet playing : an overview of two years of practicing the HandSketch, In NIME, pp.173-174, 2009. ,
Feride Çetin and Hannes Pirker. The speech conductor : gestural control of speech synthesis, eINTERFACE'05-Summer Workshop on Multimodal Interfaces, 2005. ,
Real-time CALM synthesizer new approaches in hands-controlled voice synthesis, Proceedings of the 2006 conference on New interfaces for musical expression, pp.266-271, 2006. ,
Ramcess : Realtime and accurate musical control of expression in singing synthesis, eINTERFACE'06-SIMILAR NoE Summer Workshop on Multimodal Interfaces, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-01712501
Chironomic stylization of intonation, The Journal of the Acoustical Society of America, vol.129, issue.72, pp.1594-1604, 2011. ,
Drawing melodies : Evaluation of chironomic singing synthesis, The Journal of the Acoustical Society of America, vol.135, issue.6, pp.3601-3612, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01621771
, Realtime and Accurate Musical Control of Expression in Voice Synthesis, 2009.
, Bibliographie
, João Antônio de Moraes and Albert Rilliard. Illocution, attitudes and prosody. Spoken Corpora and Linguistic Studies, pp.233-2014, 2014.
, Monika Dörfler Quilted Gabor frames?A new concept for adaptive time-frequency representation, Advances in Applied Mathematics, vol.47, issue.4, pp.668-687, 2011.
Boris Doval and Christophe d'Alessandro. Spectral correlates of glottal waveform models : an analytic study, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.1295-1298, 1997. ,
The voice source as a causal/anticausal linear filter, ISCA Tutorial and Research Workshop on Voice Quality : Functions, 2003. ,
URL : https://hal.archives-ouvertes.fr/hal-00371680
The spectrum of glottal flow models, Acta acustica united with acustica, vol.92, issue.6, pp.1026-1046, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-00368131
A synthetic speaker, Journal of the Franklin Institute, vol.227, issue.6, pp.739-764, 1939. ,
DOI : 10.1016/S0016-0032(39)90816-1
, Homer Dudley Remaking speech, The Journal of the Acoustical Society of America, vol.11, issue.2 2, pp.169-177, 1939.
François Bataille and Olivier Van der Vrecken. The MBROLA project : Towards a set of high quality speech synthesizers free of use for non commercial purposes, Spoken Language ICSLP 96. Proceedings., Fourth International Conference on, pp.1393-1396, 1996. ,
Improved time-scaling of musical audio using phase locking at transients. In Audio Engineering Society Convention 112, Audio Engineering Society, 2002. ,
Phase vocoders with arbitrary frequency band selection, Proceedings of the 9th Sound and Music Computing Conference (SMC'12), pp.2012-2029 ,
Comparison of chironomic stylization versus statistical modeling of prosody for expressive speech synthesis, Sixteenth Annual Conference of the International Speech Communication Association, pp.2015-165 ,
URL : https://hal.archives-ouvertes.fr/hal-01621843
, Marc Evrard Synthèse de parole expressive à partir du texte : Des phonostyles au contrôle gestuel pour la synthèse paramétrique statistique, pp.19-165, 2015.
, Gunnar Fant Acoustic theory of speech production. Mouton, pp.14-15, 1970.
Glove-Talk: a neural network interface between a data-glove and a speech synthesizer, IEEE Transactions on Neural Networks, vol.4, issue.1, pp.2-8, 1993. ,
DOI : 10.1109/72.182690
Glove-talk II - a neural-network interface which maps gestures to parallel formant speech synthesizer controls, IEEE transactions on neural networks, pp.205-212, 1998. ,
DOI : 10.1109/72.623199
Cantor Digitalis: chironomic parametric synthesis of singing, EURASIP Journal on Audio, Speech, and Music Processing, vol.22, issue.1, pp.2017-78, 2017. ,
DOI : 10.2307/3681043
, Lionel Feugere. Synthèse par règles de la voix chantée contrôlée par le geste et applications musicales, p.2013, 2013.
The Wekinator : a system for real-time, interactive machine learning in music, Proceedings of The Eleventh International Society for Music Information Retrieval Conference, pp.2010-63, 2010. ,
Phase Vocoder, Bell System Technical Journal, vol.45, issue.9, pp.1493-1509, 1966. ,
DOI : 10.1002/j.1538-7305.1966.tb01706.x
URL : https://asa.scitation.org/doi/pdf/10.1121/1.1939800
EasyAlign : an automatic phonetic alignment tool under Praat, Interspeech, pp.2011-129, 2011. ,
Beyond arousal: Valence and potency/control cues in the vocal expression of emotion, The Journal of the Acoustical Society of America, vol.128, issue.3, pp.1322-1336 ,
DOI : 10.1121/1.3466853
A trial of communicative prosody generation based on control characteristic of one word utterance observed in real conversational speech, Proc. Speech Prosody, pp.37-40, 2006. ,
Mouline and Francis Charpentier. A diphone synthesis system based on time-domain prosodic modifications of speech, Acoustics, Speech, and Signal Processing International Conference on, pp.238-241, 1989. ,
, Bibliographie
, Nathalie Henrich Etude de la source glottique en voix parlée et chantée : modélisation et estimation, mesures acoustiques et électroglottographiques , perception, p.15, 2001.
The Importance of Parameter Mapping in Electronic Instrument Design, Journal of New Music Research, vol.32, issue.4, pp.429-440, 2003. ,
DOI : 10.1076/jnmr.32.4.429.18853
, International MIDI Association IMA. MIDI musical instrument digital interface specification 1.0, 1983.
, Satoshi Imai Cepstral analysis synthesis on the mel frequency scale, Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'83, pp.93-96, 1983.
, Fumitada Itakura. A statistical method for estimation of speech spectral density and formant frequency, IEICE Trans, vol.53, issue.1, pp.35-42, 1970.
, Fumitada Itakura Line spectrum representation of linear predictor coefficients of speech signals, The Journal of the Acoustical Society of America, vol.57, issue.S1, pp.35-35, 1975.
TIME-FREQUENCY JIGSAW PUZZLE: ADAPTIVE MULTIWINDOW AND MULTILAYERED GABOR EXPANSIONS, Multiresolution and Information Processing, pp.293-315, 2007. ,
DOI : 10.1006/acha.1997.0209
URL : https://hal.archives-ouvertes.fr/hal-00350152
, Francis Katamba. An introduction to phonology, 1989.
Restructuring speech representations using a pitch-adaptive time???frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, Speech Communication, vol.27, issue.3-4, pp.187-207, 1999. ,
DOI : 10.1016/S0167-6393(98)00085-5
URL : https://hal.archives-ouvertes.fr/hal-01105608
, Prosodie und emotionen, pp.40-49, 2002.
Hideki Kenmochi and Hayato Ohshita VOCALOIDcommercial singing synthesizer based on sample concatenation, Interspeech, pp.4009-4010, 2007. ,
, Loic Kessous Contrôles gestuels bi-manuels de processus sonores, 2004.
, Loic Kessous Gestural control of singing voice, a musical instrument, Proceedings of Sound and Music Computing, pp.78-154, 2004.
On the Auditory Perception of Tones in Mandarin, Phonetica, vol.20, issue.2-4, pp.63-67, 1969. ,
DOI : 10.1159/000259274
Human frequency-following response: representation of pitch contours in Chinese tones, Hearing Research, vol.189, issue.1-2, pp.1-12, 2004. ,
DOI : 10.1016/S0378-5955(03)00402-7
, Acoustic properties of phonemes in continuous speech for different speaking rate, Spoken Language ICSLP 96. Proceedings., Fourth International Conference on, pp.2435-2438, 1996.
0 range in signaling speaker affect, The Journal of the Acoustical Society of America, vol.78, issue.2, pp.435-444, 1985. ,
DOI : 10.1121/1.392466
Jean Laroche and Mark Dolson New phase-vocoder techniques for pitch-shifting, harmonizing and other exotic effects, Applications of Signal Processing to Audio and Acoustics, pp.91-94, 1999. ,
Calliphony : a real-time intonation controller for expressive speech synthesis, SSW, pp.345-350, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-01621891
Sylvain Le Beux, Boris Doval and Christophe d'Alessandro. Issues and solutions related to real-time TD-PSOLA implementation, Audio Engineering Society Convention 128, pp.14-26, 2010. ,
Sylvain Le Beux Contrôle gestuel de la prosodie et de la qualité vocale, pp.9-10, 2009. ,
Evaluation of contextual descriptors for HMM-based speech synthesis in French, SSW, pp.153-158 ,
URL : https://hal.archives-ouvertes.fr/hal-00987809
Précis de phonostylistique : parole et expressivité, 1993. ,
A sines+ transients+ noise audio representation for data compression and time/pitch scale modifications, Audio Engineering Society Convention 105. Audio Engineering Society, 1998. ,
DOI : 10.1121/1.428679
Reiterant speech as a test of non-native speakers' mastery of the timing of French, The Journal of the Acoustical Society of America, vol.90, issue.6, pp.3008-3018, 1991. ,
Fine-grain voice strength estimation from vowel spectral cues, INTERSPEECH, pp.128-132 ,
, Bibliographie
On the R??le of Formant Transitions in Vowel Recognition, The Journal of the Acoustical Society of America, vol.42, issue.4, pp.830-843, 1967. ,
DOI : 10.1121/1.1910655
Phase vocoder and beyond, pp.73-120 ,
URL : https://hal.archives-ouvertes.fr/hal-01250848
Sound analysis and synthesis adaptive in time and two frequency bands. arXiv preprint, pp.2011-2028 ,
URL : https://hal.archives-ouvertes.fr/hal-00626914
A reduced multiple Gabor frame for local time adaptation of the spectrogram. arXiv preprint, pp.2011-2028 ,
URL : https://hal.archives-ouvertes.fr/hal-00626839
The frame/content theory of evolution of speech production, Behavioral and brain sciences, vol.21, issue.53, pp.499-511, 1998. ,
, Paolo Mairano Rhythm typology : acoustic and perceptive studies, pp.2011-2059, 2011.
Speech analysis/Synthesis based on a sinusoidal representation, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.34, issue.4, pp.744-754, 1986. ,
DOI : 10.1109/TASSP.1986.1164910
WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications, IEICE Transactions on Information and Systems, vol.99, issue.7, pp.1877-1884, 2016. ,
DOI : 10.1587/transinf.2015EDP7457
, Eric Moulines and Francis Charpentier Pitchsynchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech communication, vol.9, issue.18, pp.453-467, 1990.
Non-parametric techniques for pitch-scale and time-scale modification of speech, Speech Communication, vol.16, issue.2, pp.175-205, 1995. ,
DOI : 10.1016/0167-6393(94)00054-E
Audition dominates vision in duration perception irrespective of salience, attention, and temporal discriminability, Attention, Perception, & Psychophysics, vol.88, issue.1, pp.1485-1502 ,
DOI : 10.1037/0033-2909.88.3.638
, Jane Orton Educating Chinese language teachers?Some fundamentals. Teaching and learning Chinese in global contexts : CFL worldwide, pp.151-164, 2011.
, Olivier Perrotin and Christophe d'Alessandro Adaptive mapping for improved pitch accuracy on touch user interfaces, NIME, pp.186-189, 2013.
Seeing, Listening, Drawing, ACM Transactions on Applied Perception, vol.14, issue.2, pp.10-56, 2016. ,
DOI : 10.1145/1279740.1279758
URL : https://hal.archives-ouvertes.fr/hal-01672241
, Alessandro 2016b] Olivier Perrotin and Christophe d'Alessandro. Vocal effort modification for singing synthesis, Spectrum, vol.100, pp.50-2016
, Olivier Perrotin Chanter avec les mains : interfaces chironomiques pour les instruments de musique numériques, pp.121-139, 2015.
, Bernd Pompino-Marschall. On the psychoacoustic nature of the P-center phenomenon, Journal of phonetics, 1989.
, Ernst Pöppel. TENAPORAL MECHANISMS IN PERCEPTION. Selectionism and the Brain, vol.37, p.185, 1994.
Real-time audio analysis tools for Pd and MSP, 1998. ,
Speech transformations based on a sinusoidal representation, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.34, issue.6, pp.1449-1464, 1986. ,
DOI : 10.1109/TASSP.1986.1164985
, Analyse, représentation et traitement du geste instrumental : application aux instruments à clavier, 1991.
Sensorimotor synchronization : a review of the tapping literature, Psychonomic bulletin & review, vol.12, issue.6, pp.969-992, 2005. ,
Cerebellar nuclear topography of simple and synergistic movements in the alert baboon (Papio papio), Experimental Brain Research, vol.47, issue.3, pp.365-380, 1982. ,
DOI : 10.1007/BF00239355
, Axel Röbel. A new approach to transient processing in the phase vocoder, 6th International Conference on Digital Audio Effects (DAFx), pp.344-349, 2003.
, Gestures : Their role in teaching and learning, Review of educational research, vol.71, issue.3, pp.365-392, 2001.
Instrumental gestural mapping strategies as expressi- Bibliographie vity determinants in computer music performance, Proceedings of Kansei- The Technology of Emotion Workshop, pp.3-4, 1997. ,
Superposition Frames for Adaptive Time-Frequency Analysis and Fast Reconstruction, IEEE Transactions on Signal Processing, vol.58, issue.5, pp.2581-2596 ,
DOI : 10.1109/TSP.2010.2041604
A versatile software parallel-formant speech synthesizer, Joint Speech Res. Unit, 1982. ,
System for Automatic Formant Analysis of Voiced Speech, The Journal of the Acoustical Society of America, vol.47, issue.2B, pp.634-648, 1970. ,
DOI : 10.1121/1.1911939
From singing to speaking : why singing may lead to recovery of expressive language function in patients with Broca's aphasia. Music perception : An interdisciplinary journal, pp.315-323, 2008. ,
Evidence for in white-matter tracts of patients with chronic broca's aphasia undergoing intense intonation-based speech therapy, Annals of the New York Academy of Sciences, vol.1169, issue.1, pp.385-394, 2009. ,
, Perceptual centers in speech -An acoustic analysis, 1993.
Spectral Modeling Synthesis: A Sound Analysis/Synthesis System Based on a Deterministic Plus Stochastic Decomposition, Computer Music Journal, vol.14, issue.4, pp.12-24, 1990. ,
DOI : 10.2307/3680788
Toward a register approach in teaching Mandarin tones, Journal of Chinese Language Teachers Association, vol.24, issue.3, pp.27-47, 1989. ,
, Harmonic plus noise models for speech, combined with statistical methods, for speech and speaker modification, 1996.
, The Journal of the Acoustical Society of America, vol.87, issue.1, pp.462-463, 1990.
DOI : 10.1121/1.399243
Happy talk : Perceptual and acoustic effects of smiling on speech, Perception & psychophysics, vol.27, issue.170, pp.24-27, 1980. ,
Vocal intensity in speakers and singers, The Journal of the Acoustical Society of America, vol.91, issue.5, pp.2936-2946, 1992. ,
DOI : 10.1121/1.402929
Mel-generalized cepstral analysis-a unified approach to speech spectral estimation, ICSLP, pp.18-22, 1994. ,
Speech signal processing toolkit (SPTK) Online], recent version, pp.2012-166 ,
Speech Synthesis Based on Hidden Markov Models, Proceedings of the IEEE, pp.1234-1252 ,
DOI : 10.1109/JPROC.2013.2251852
Expression Control in Singing Voice Synthesis: Features, approaches, evaluation, and challenges, IEEE Signal Processing Magazine, vol.32, issue.6, pp.55-73 ,
DOI : 10.1109/MSP.2015.2424572
Measuring perceptual centers using the phase correction response, Attention, Perception, & Psychophysics, vol.133, issue.5, pp.1614-1629 ,
DOI : 10.1037/0033-2909.133.2.273
, Mechanismus der menschlichen sprache. Degen, 1791
The rhythm of language and speech : Constraining factors, models, metrics and applications, pp.44-71, 2008. ,
The Role of Tapping in Improving Connected Speech Comprehension of a Non-Native Variety of English, 2017. ,
Gestural Control of Sound Synthesis, Proceedings of the IEEE, vol.92, issue.4, pp.632-644, 2004. ,
DOI : 10.1109/JPROC.2004.825882
On the Choice of Transducer Technologies for Specific Musical Functions, International Computer Music Conference, 2000. ,
URL : https://hal.archives-ouvertes.fr/hal-01105531
Training American listeners to perceive Mandarin tones, The Journal of the Acoustical Society of America, vol.106, issue.6, pp.3649-3658, 1999. ,
DOI : 10.1121/1.428217
The phonological status of syllabic consonants in English RP, Phonetica, vol.13, issue.12, pp.110-113, 1965. ,
Tonal perception errors and interference from English intonation, Journal of Chinese Language Teachers Association, vol.16, issue.2, pp.27-56, 1981. ,
, Bibliographie
Timothy Wiseet al. Yodel species : a typology of falsetto effects in popular music vocal styles, Radical Musicology, vol.2, p.57, 2007. ,
New Musical Control Structures from Standard Gestural Controllers, International Computer Music Conference, 1997. ,
Inspect, Embody, Invent, Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, CHI '16, pp.5397-5408 ,
DOI : 10.1145/2468356.2479564
, ACM, 2016.
MirrorFugue iii, CHI '13 Extended Abstracts on Human Factors in Computing Systems on, CHI EA '13, pp.2891-2892 ,
DOI : 10.1145/2468356.2479564
Andantino : Teaching Children Piano with Projected Animated Characters, Proceedings of the The 15th International Conference on Interaction Design and Children, pp.37-45 ,