K. Alho, J. F. Connolly, M. Cheour, A. Lehtokoski, M. Huotilainen et al., Hemispheric lateralization in preattentive processing of speech sounds, Neuroscience Letters, vol.258, issue.1, p.24, 1998.

J. L. Anderson, J. L. Morgan, and K. S. Et-white, A statistical basis for speech sound discrimination, Language and Speech, vol.46, issue.2-3, p.33, 2003.

V. Aubanel, Variation phonologique régionale en interaction conversationnelle, p.15, 2011.

M. E. Babel, Phonetic and social selectivity in speech accommodation, 2009.

L. Badino, C. Canevari, L. Fadiga, and G. Et-metta, An auto-encoder based approach to unsupervised learning of subword units, IEEE International Conference on Acoustics, Speech and Signal Processing, p.44, 2014.

L. Badino, C. Canevari, L. Fadiga, and G. Et-metta, Integrating articulatory data in deep neural network-based acoustic modeling, Computer Speech & Language, vol.36, pp.173-195, 2016.

G. Bailly, Learning to speak. Sensori-motor control of speech movements, Speech Communication, vol.22, issue.2-3, pp.251-267, 1997.

E. Baker, S. E. Blumstein, and H. Et-goodglass, Interaction between phonological and semantic factors in auditory comprehension, Neuropsychologia, vol.19, issue.1, p.13, 1981.

A. Baranes and P. Oudeyer, Intrinsically motivated goal exploration for active motor learning in robots : A case study, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010), p.57, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00541769

A. Baranes and P. Oudeyer, Active learning of inverse models with intrinsically motivated goal exploration in robots, Robotics and Autonomous Systems, vol.61, issue.1, p.57, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00788440

M. Barnaud, P. Bessière, J. Diard, and J. Schwartz, Reanalyzing neurocognitive data on the role of the motor system in speech perception within COSMO, a Bayesian perceptuo-motor model of speech communication, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01669961

M. Barnaud, J. Diard, P. Bessière, and J. Schwartz, Modeling the concurrent development of speech perception and production in a Bayesian framework, The 5th Joint IEEE International Conference on Developmental and Learning and on Epigenetic Robotics (ICDL-Epirob 2015), pp.248-249, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01202420

M. Barnaud, J. Diard, P. Bessière, and J. Schwartz, Modeling the concurrent development of speech perception and production in a Bayesian framework, Workshop on Probabilistic Inference and the Brain, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01202420

M. Barnaud, J. Diard, P. Bessière, and J. Schwartz, Assessing Idiosyncrasies in a Bayesian Model of Speech Communication, Proceedings of Interspeech 2016, pp.2080-2084, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01371722

M. Barnaud, J. Diard, P. Bessière, and J. Schwartz, Assessing phonological learning in COSMO, a Bayesian model of speech communication, The 7th Joint IEEE International Conference on Developmental and Learning and on Epigenetic Robotics, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01614145

M. Barnaud, J. Diard, P. Bessière, and J. Schwartz, Perceptuo-motor speech units in the brain with COSMO, a Bayesian model of communication, Proceedings of the 11th International Seminar on Speech Production, p.196, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01614179

M. Barnaud, J. Diard, P. Bessière, and J. Schwartz, Computational simulations of perceptuo-motor idiosyncrasies support the involvement of motor knowledge in speech perception, p.202

M. Barnaud, R. Laurent, P. Bessière, J. Diard, and J. Schwartz, Modeling concurrent development of speech perception and production in a Bayesian framework, Workshop on Infant Language Development (WILD), 2015.
URL : https://hal.archives-ouvertes.fr/hal-01202417

M. Barnaud, J. Schwartz, J. Diard, and P. Et-bessière, Sensorimotor learning in a Bayesian computational model of speech communication, The 6th Joint IEEE International Conference on Developmental and Learning and on Epigenetic Robotics, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01371719

M. Bastiaansen and P. Hagoort, Oscillatory neuronal dynamics during language comprehension, Progress in Brain Research, vol.159, pp.179-196, 2006.

I. Bazzi and J. Glass, Heterogeneous lexical units for automatic speech recognition : preliminary investigations, IEEE International Conference on Acoustics, Speech and Signal Processing, vol.3, p.53, 2000.

F. Bell-berti, L. J. Raphael, D. B. Pisoni, and J. R. Sawusch, Some relationships between speech production and perception, Phonetica, vol.36, issue.6, p.19, 1979.

H. K. Beller, Priming : effects of advance information on matching, Journal of Experimental Psychology : General, vol.87, issue.2, pp.176-182, 1971.

M. Benzeghiba, R. De-mori, O. Deroo, S. Dupont, T. Erbes et al., Automatic speech recognition and speech variability : A review, Speech Communication, vol.49, issue.10, p.44, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00616506

P. Bertelson, The onset of literacy : Cognitive processes in reading acquisition, p.22, 1986.

J. Bertoncini, R. Bijeljac-babic, P. W. Jusczyk, L. J. Kennedy, and J. Et-mehler, An investigation of young infants' perceptual representations of speech sounds, Journal of Experimental Psychology : General, vol.117, issue.1, pp.21-33, 1988.

J. Bertoncini and J. Mehler, Syllables as units in infant speech perception, Infant Behavior and Development, vol.4, p.39, 1981.

P. Bessière, E. Mazer, J. M. Ahuactzin, and K. Et-mekhnacha, Bayesian Programming, 2013.

C. T. Best, G. W. Mcroberts, R. Lafleur, and J. Et-silver-isenstadt, Divergent developmental patterns for infants' perception of two nonnative consonant contrasts, Infant Behavior and Development, vol.18, issue.3, p.29, 1995.

R. Bijeljac-babic, J. Bertoncini, and J. Et-mehler, How do 4-day-old infants categorize multisyllabic utterances ?, Developmental Psychology, vol.29, issue.4, p.39, 1993.

J. A. Bilmes, A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models, 1998.

J. R. Binder, E. Liebenthal, E. T. Possing, D. A. Medler, and B. D. Ward, Neural correlates of sensory and decision processes in auditory object identification, Nature Neuroscience, vol.7, issue.3, pp.295-301, 2004.

K. Bloom, Social elicitation of infant vocal behavior, Journal of Experimental Child Psychology, vol.20, issue.1, p.36, 1975.

K. Bloom, Quality of adult vocalizations affects the quality of infant vocalizations, Journal of Child Language, vol.15, issue.3, p.36, 1988.

K. Bloom, A. Russell, and K. Et-wassenberg, Turn taking affects the quality of infant vocalizations, Journal of Child Language, vol.14, issue.2, p.36, 1987.

S. E. Blumstein, The neurobiology of the sound structure of language, The Cognitive Neurosciences, p.13, 1995.

J. W. Bohland, D. Bullock, and F. H. Et-guenther, Neural representations and mechanisms for the performance of simple speech sequences, Journal of Cognitive Neuroscience, vol.22, issue.7, p.51, 2010.

L. Bosch and N. Et-sebastián-gallés, Language experience and the perception of a voicing contrast in fricatives : Infant and adult data, Proceedings of the 15th International Conference of Phonetic Sciences, p.29, 2003.

N. J. Bourguignon, S. R. Baum, and D. M. Shiller, Lexical-perceptual integration influences sensorimotor adaptation in speech, Frontiers in Human Neuroscience, vol.8, issue.208, p.16, 2014.

H. Brandl, B. Wrede, F. Joublin, and C. Et-goerick, A self-referential childlike model to acquire phones, syllables and words from acoustic speech, IEEE International Conference on Developmental and Learning, vol.60, pp.31-36, 2008.

M. Bruck, R. Treiman, and M. Et-caravolas, Role of the syllable in the processing of spoken English : Evidence from a nonword comparison task, Journal of Experimental Psychology : Human Perception and Performance, vol.21, issue.3, p.22, 1995.

B. R. Buchsbaum, G. Hickok, and C. Et-humphries, Role of left posterior superior temporal gyrus in phonological processing for speech perception and production, Cognitive Science, vol.25, issue.5, p.25, 2001.

D. E. Callan, A. Callan, and J. A. Et-jones, Speech motor brain regions are differentially recruited during perception of native and foreign-accented phonemes for first and second language listeners, Frontiers in Neuroscience, vol.8, issue.275, p.13, 2014.
DOI : 10.3389/fnins.2014.00275

URL : https://www.frontiersin.org/articles/10.3389/fnins.2014.00275/pdf

D. E. Callan, J. A. Jones, A. M. Callan, and R. Et-akahane-yamada, Phonetic perceptual identification by native-and second-language speakers differentially activates brain regions involved with acoustic phonetic processing and those involved with articulatory-auditory/orosensory internal models, NeuroImage, vol.22, issue.3, p.13, 2004.
DOI : 10.1016/j.neuroimage.2004.03.006

C. Canevari, L. Badino, A. D'ausilio, L. Fadiga, and G. Et-metta, Modeling speech imitation and ecological learning of auditory-motor maps, Frontiers in Psychology, vol.4, issue.364, pp.1-12, 2013.
DOI : 10.3389/fpsyg.2013.00364

URL : https://www.frontiersin.org/articles/10.3389/fpsyg.2013.00364/pdf

J. M. Carroll, M. J. Snowling, J. Stevenson, and C. Et-hulme, The development of phonological awareness in preschool children, Developmental Psychology, vol.39, issue.5, pp.913-923, 2003.

C. Castellini, L. Badino, G. Metta, G. Sandini, M. Tavella et al., The use of phonetic motor invariants can improve automatic phoneme discrimination, PLoS One, vol.6, issue.9, 2011.
DOI : 10.1371/journal.pone.0024055

URL : https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0024055&type=printable

M. Cheour, R. Ceponiene, A. Lehtokoski, A. Luuk, J. Allik et al., Development of language-specific phoneme representations in the infant brain, Nature Neuroscience, vol.1, issue.5, pp.351-353, 1998.

C. Cheung, L. S. Hamilton, K. Johnson, and E. F. Chang, The auditory representation of speech sounds in human motor cortex, vol.5, p.12577, 2016.

N. Chomsky, A review of BF Skinner's verbal behavior, Language, vol.35, issue.1, pp.26-58, 1959.

M. Clayards, M. K. Tanenhaus, R. N. Aslin, and R. A. Jacobs, Perception of speech reflects optimal use of probabilistic speech cues, Cognition, vol.108, issue.3, pp.804-809, 2008.

M. H. Coen, Self-supervised acquisition of vowels in American English, Proceedings of the National Conference on Artificial Intelligence, vol.21, pp.1451-1456, 2006.

F. Colas, J. Diard, and P. Et-bessière, Common Bayesian models for common cognitive issues, Acta Biotheoretica, vol.58, issue.2-3, pp.191-216, 2010.
DOI : 10.1007/s10441-010-9101-1

URL : https://hal.archives-ouvertes.fr/hal-00530356

B. T. Conboy, M. Rivera-gaxiola, L. Klarman, E. Aksoylu, and P. K. Et-kuhl, Associations between native and nonnative speech sound discrimination and language development at the end of the first year, Supplement to the Proceedings of the 29th Boston University Conference on Language Development, p.29, 2005.

B. T. Conboy, M. Rivera-gaxiola, J. Silva-pereyra, and P. K. Et-kuhl, Event-related potential studies of early language processing at the phoneme, word, and sentence levels, Early Language Development : Bridging Brain and Behaviour, vol.5, p.29, 2008.
DOI : 10.1075/tilar.5.04con

B. T. Conboy, J. A. Sommerville, and P. K. Et-kuhl, Cognitive control factors in speech perception at 11 months, Developmental Psychology, vol.44, issue.5, p.35, 2008.
DOI : 10.1037/a0012975

URL : http://europepmc.org/articles/pmc2562344?pdf=render

A. Content and U. H. Frauenfelder, La syllabe comme unité de perception de la parole : un état de la question, 2002.

A. Content, C. Meunier, R. K. Kearns, and U. H. Et-frauenfelder, Sequence detection in pseudo, French : Where is the syllable effect ? Language and Cognitive Processes, vol.16, p.22, 2001.

F. S. Cooper, P. C. Delattre, A. M. Liberman, J. M. Borst, and L. J. Et-gerstman, Some experiments on the perception of synthetic speech sounds, The Journal of the Acoustical Society of America, vol.24, issue.6, pp.597-606, 1952.

W. E. Cooper, Speech perception and production : Studies in selective adaptation, p.15, 1979.

W. E. Cooper, R. R. Ebert, and R. A. Et-cole, Speech perception and production of the consonant cluster, Journal of Experimental Psychology : Human Perception and Performance, vol.2, issue.1, p.15, 1976.

W. E. Cooper and M. R. Et-lauritsen, Feature processing in the perception and production of speech, Nature, vol.252, issue.5479, p.15, 1974.

A. Crompton, Syllables and segments in speech production, Linguistics, vol.19, issue.7-8, pp.663-716, 1981.

A. Cutler, J. M. Mcqueen, D. Norris, and A. Et-somejuan, The roll of the silly ball, Language, Brain and Cognitive Development : Essays in honor of Jacques Mehler, p.23, 2001.

A. Cutler, J. Mehler, D. Norris, and J. Et-segui, A language-specific comprehension strategy, Nature, vol.304, issue.5922, p.22, 1983.

A. Cutler, J. Mehler, D. Norris, and J. Et-segui, The syllable's differing role in the segmentation of French and English, Journal of Memory and Language, vol.25, issue.4, pp.385-400, 1986.

A. Cutler and D. Norris, Monitoring sentence comprehension, Sentence processing : Psycholinguistic studies presented to Merrill Garrett, p.51, 1979.

D. 'ausilio, A. Bufalari, I. Salmas, P. Et-fadiga, and L. , The role of the motor system in discriminating normal and degraded speech sounds, Cortex, vol.48, issue.7, p.13, 2012.

D. 'ausilio, A. Craighero, L. Et-fadiga, and L. , The contribution of the frontal lobe to the perception of speech, Journal of Neurolinguistics, vol.25, issue.5, p.14, 2012.

D. 'ausilio, A. Pulvermüller, F. Salmas, P. Bufalari, I. Begliomini et al., The motor somatotopy of speech perception, Current Biology, vol.19, issue.5, p.13, 2009.

B. L. Davis and P. F. Et-macneilage, The articulatory basis of babbling, Journal of Speech, Language, and Hearing Research, vol.38, issue.6, p.31, 1995.

B. De-boer and P. K. Kuhl, Investigating the role of infant-directed speech with a computer model, Acoustics Research Letters Online, vol.4, issue.4, pp.129-134, 2003.

B. De-boysson-bardies and P. Hallé, Des « capacités précoces » à l'élaboration du premier lexique, Psycholinguistique cognitve, chapter 15, p.29, 2004.

B. De-boysson-bardies, P. Hallé, L. Sagart, and C. Et-durand, A crosslinguistic investigation of vowel formants in babbling, Journal of Child Language, vol.16, issue.1, p.35, 1989.

B. De-boysson-bardies, L. Sagart, and C. Et-durand, Discernible differences in the babbling of infants according to target language, Journal of Child Language, vol.11, issue.1, p.35, 1984.

B. De-boysson-bardies and M. M. Et-vihman, Adaptation to language : Evidence from babbling and first words in four languages, Language, vol.67, issue.2, pp.297-319, 1991.

S. Decoene, Testing the speech unit hypothesis with the primed matching task : phoneme categories are perceptually basic. Attention, Perception, & Psychophysics, vol.53, pp.601-616, 1993.

G. Dehaene-lambertz and S. Baillet, A phonological representation in the infant brain, NeuroReport, vol.9, issue.8, pp.1885-1888, 1998.

G. Dehaene-lambertz and M. Peña, Electrophysiological evidence for automatic phonetic processing in neonates, NeuroReport, vol.12, issue.14, pp.3155-3158, 2001.

P. C. Delattre, A. M. Liberman, and F. S. Cooper, Acoustic loci and transitional cues for consonants, The Journal of the Acoustical Society of America, vol.27, issue.4, pp.769-773, 1955.

G. S. Dell, A spreading-activation theory of retrieval in sentence production, Psychological Review, vol.93, issue.3, p.283, 1986.

A. P. Dempster, N. M. Laird, and D. B. Et-rubin, Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society : Series B (Statistical Methodology), vol.39, issue.1, p.55, 1977.

R. A. Depaolis, M. M. Vihman, and T. Et-keren-portnoy, Do production patterns influence the processing of speech in prelinguistic infants ? Infant Behavior and Development, vol.34, p.37, 2011.

J. Diard, Bayesian Algorithmic Modeling in Cognitive Science. Habilitation à diriger des recherches (HDR), 2015.
URL : https://hal.archives-ouvertes.fr/tel-01237127

R. L. Diehl, A. J. Lotto, and L. L. Holt, Speech perception, Annual Review of Psychology, vol.55, pp.149-179, 2004.

B. Dillon, E. Dunbar, and W. J. Et-idsardi, A single-stage approach to learning phonological categories : Insights from Inuktitut, Cognitive Science, vol.37, issue.2, pp.344-377, 2013.

M. Dole, C. Vilain, A. Vilain, H. Et-loevenbruck, and J. Schwartz,

E. Dupoux, G. Beraud-sudreau, and S. Et-sagayama, Templatic features for modeling phoneme acquisition, Proceedings of the 31st Annual Conference of the Cognitive Science Society, vol.60, pp.219-224, 2011.

C. Eckers, B. J. Kröger, K. Sass, and S. Heim, Neural representation of the sensorimotor speech-action-repository, Frontiers in Human Neuroscience, vol.7, issue.121, pp.1-10, 2013.

R. E. Eilers, W. Gavin, and W. R. Et-wilson, Linguistic experience and phonemic perception in infancy : A crosslinguistic study, Child Development, vol.50, issue.1, p.28, 1979.

R. E. Eilers and F. D. Et-minifie, Fricative discrimination in early infancy, Journal of Speech, Language, and Hearing Research, vol.18, issue.1, p.27, 1975.

P. D. Eimas, Auditory and phonetic coding of the cues for speech : Discrimination of the [rl] distinction by young infants. Attention, Perception, & Psychophysics, vol.18, p.27, 1975.

P. D. Eimas and J. D. Et-corbit, Selective adaptation of linguistic feature detectors, Cognitive Psychology, vol.4, issue.1, p.15, 1973.

P. D. Eimas and J. L. Miller, Discrimination of information for manner of articulation, Infant Behavior and Development, vol.3, p.27, 1980.

P. D. Eimas, E. R. Siqueland, P. Juscyk, and J. Et-vigorito, Speech perception in infants, Science, vol.171, issue.3968, p.27, 1971.

K. Ejiri, Relationship between rhythmic behavior and canonical babbling in infant vocal development, Phonetica, vol.55, issue.4, p.31, 1998.

K. Ejiri and N. Masataka, Co-occurences of preverbal vocal behavior and motor action in early infancy, Developmental Science, vol.4, issue.1, p.31, 2001.

L. Elbers, Operating principles in repetitive babbling : A cognitive continuity approach, Cognition, vol.12, issue.1, p.31, 1982.

L. Fadiga, L. Craighero, G. Buccino, and G. Et-rizzolatti, Speech listening specifically modulates the excitability of tongue muscles : a TMS study, European Journal of Neuroscience, vol.15, issue.2, p.13, 2002.

M. K. Fagan, Mean length of utterance before words and grammar : Longitudinal trends and developmental implications of infant vocalizations, Journal of Child Language, vol.36, issue.3, p.31, 2009.

N. H. Feldman, T. L. Griffiths, S. Goldwater, and J. L. Et-morgan, A role for the developing lexicon in phonetic category acquisition, Psychological Review, vol.120, issue.4, pp.751-778, 2013.

N. H. Feldman, T. L. Griffiths, and J. L. Et-morgan, The influence of categories on perception : Explaining the perceptual magnet effect as optimal statistical inference, Psychological Review, vol.116, issue.4, pp.752-782, 2009.

N. H. Feldman, T. L. Griffiths, and J. L. Et-morgan, Learning phonetic categories by learning a lexicon, Proceedings of the 31st Annual Conference of the Cognitive Science Society, pp.2208-2213, 2009.

N. H. Feldman, E. B. Myers, and K. S. Et-white, Learners use word-level statistics in phonetic category acquisition, Proceedings of the 35th Boston University Conference on Language Development, p.56, 2011.

N. H. Feldman, E. B. Myers, K. S. White, T. L. Griffiths, and J. L. Et-morgan, Word-level information influences phonetic learning in adults and infants, Cognition, vol.127, issue.3, p.184, 2013.
DOI : 10.1016/j.cognition.2013.02.007

URL : http://europepmc.org/articles/pmc3646897?pdf=render

T. M. Field, R. Woodson, R. Greenberg, and D. Et-cohen, Facial expression by neonates. Annual Progress in Child Psychiatry and Child Development, vol.16, p.36, 1983.

S. A. Finney, A. Protopapas, and P. D. Et-eimas, Attentional allocation to syllables in American English, Journal of Memory and Language, vol.35, issue.6, p.22, 1996.
DOI : 10.1006/jmla.1996.0046

D. J. Foss and D. A. Swinney, On the psychological reality of the phoneme : Perception, identification, and consciousness, Journal of Verbal Learning and Verbal Behavior, vol.12, issue.3, p.21, 1973.

A. E. Fowler, S. Brady, and D. P. Et-shankweiler, How early phonological development might set the stage for phoneme awareness, vol.106, pp.97-117, 1991.

C. A. Fowler, Segmentation of coarticulated speech in perception, Attention, Perception, & Psychophysics, vol.36, issue.4, p.21, 1984.
DOI : 10.3758/bf03202790

URL : https://link.springer.com/content/pdf/10.3758%2FBF03202790.pdf

C. A. Fowler, An event approach to the study of speech perception from a direct-realist perspective, Journal of Phonetics, vol.14, issue.1, pp.3-28, 1986.

C. A. Fowler, J. M. Brown, L. Sabadini, and J. Et-weihing, Rapid access to speech gestures in perception : Evidence from choice and simple response time tasks, Journal of Memory and Language, vol.49, issue.3, p.18, 2003.
DOI : 10.1016/s0749-596x(03)00072-x

URL : http://europepmc.org/articles/pmc2901126?pdf=render

C. A. Fowler, D. P. Shankweiler, and M. Studdert-kennedy, Perception of the speech code revisited : Speech is alphabetic after all, Psychological Review, vol.123, issue.2, p.23, 2016.

R. A. Fox, Individual variation in the perception of vowels : Implications for a perceptionproduction link, Phonetica, vol.39, issue.1, p.19, 1982.

M. K. Franken, J. M. Mcqueen, P. Hagoort, and D. J. Et-acheson, Assessing the link between speech perception and production through individual differences, Proceedings of the 18th International Congress of Phonetic Sciences, p.19, 2015.

A. D. Friederici and J. M. Wessels, Phonotactic knowledge of word boundaries and its use in infant speech perception. Attention, Perception, & Psychophysics, vol.54, pp.287-295, 1993.

V. A. Fromkin, Speech errors as linguistic evidence. Mouton de Gruyter, The Hague, vol.23, 1984.

B. Galantucci, C. A. Fowler, and M. T. Et-turvey, The motor theory of speech perception reviewed, Psychonomic Bulletin & Review, vol.13, issue.3, pp.361-377, 2006.
DOI : 10.3758/bf03193857

URL : https://link.springer.com/content/pdf/10.3758%2FBF03193857.pdf

A. Ganapathiraju, J. Hamaker, J. Picone, M. Ordowski, and G. R. Et-doddington, Syllablebased large vocabulary continuous speech recognition, IEEE Transactions on Speech and Audio Processing, vol.9, issue.4, p.53, 2001.
DOI : 10.1109/89.917681

M. Garnier, L. Lamalle, and M. Sato, Neural correlates of phonetic convergence and speech imitation, Frontiers in Psychology, vol.4, issue.600, p.15, 2013.
DOI : 10.3389/fpsyg.2013.00600

URL : https://hal.archives-ouvertes.fr/hal-00576055

M. G. Gaskell and W. D. Et-marslen-wilson, Integrating form and meaning : A distributed model of speech perception, Language and Cognitive Processes, vol.12, issue.5-6, pp.613-656, 1997.

B. Gauthier, R. Shi, and Y. Xu, Learning phonetic categories by tracking movements, Cognition, vol.103, issue.1, p.56, 2007.

H. S. Gauvin, W. De-baene, M. Brass, and R. J. Et-hartsuiker, Conflict monitoring in speech processing : an fMRI study of error detection in speech production and perception, NeuroImage, vol.126, pp.96-105, 2016.

J. R. Gelfand and S. Y. Bookheimer, Dissociating neural mechanisms of temporal sequencing and processing phonemes, Neuron, vol.38, issue.5, p.26, 2003.

O. Ghitza, Linking speech perception and neurophysiology : speech decoding guided by cascaded oscillators locked to the input rhythm, Frontiers in Psychology, vol.2, issue.130, pp.1-13, 2011.

O. Ghitza, The theta-syllable : a unit of speech information defined by cortical function, Frontiers in Psychology, vol.4, issue.138, pp.1-5, 2013.

E. Gilet, J. Diard, and P. Et-bessière, Bayesian action-perception computational model : interaction of production and recognition of cursive letters, PLoS One, vol.6, issue.6, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00645868

A. Giraud and D. Et-poeppel, Cortical oscillations and speech processing : emerging computational principles and operations, Nature Neuroscience, vol.15, issue.4, pp.511-517, 2012.

S. D. Goldinger, Words and voices : episodic traces in spoken word identification and recognition memory, Journal of Experimental Psychology : Learning, Memory, and Cognition, vol.22, issue.5, p.55, 1996.

S. D. Goldinger, Echoes of echoes ? An episodic theory of lexical access, Psychological Review, vol.105, issue.2, pp.251-279, 1998.

S. D. Goldinger and T. Azuma, Puzzle-solving science : The quixotic quest for units in speech perception, Journal of Phonetics, vol.31, issue.3, p.24, 2003.

J. A. Goldsmith and A. Et-xanthos, Learning phonological categories. Language, vol.85, pp.4-38, 2009.

M. H. Goldstein, A. P. King, and M. J. West, Social interaction shapes babbling : Testing parallels between birdsong and speech, Proceedings of the National Academy of Sciences, vol.100, issue.13, p.36, 2003.

M. H. Goldstein and J. A. Et-schwade, Social feedback to infants' babbling facilitates rapid phonological learning, Psychological Science, vol.19, issue.5, p.36, 2008.

K. Grabski, Les cartes sensorimotrices de la parole : Corrélats neurocognitifs et couplage fonctionnel des systèmes de perception et de production des voyelles du Français, 2012.

K. Grabski, P. Tremblay, V. L. Gracco, L. Girin, and M. Sato, A mediating role of the auditory dorsal pathway in selective adaptation to speech : A state-dependent transcranial magnetic stimulation study, Brain Research, vol.1515, p.13, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00915287

S. T. Grafton, M. A. Arbib, L. Fadiga, and G. Et-rizzolatti, Localization of grasp representations in humans by positron emission tomography, Experimental Brain Research, vol.112, issue.1, p.12, 1996.

S. Guediche, S. E. Blumstein, J. A. Fiez, and L. L. Holt, Speech perception under adverse conditions : insights from behavioral, computational, and neuroscience research, Frontiers in Systems Neuroscience, vol.7, p.51, 2014.

F. H. Guenther, Speech sound acquisition, coarticulation, and rate effects in a neural network model of speech production, Psychological Review, vol.102, issue.3, pp.594-621, 1995.

F. H. Guenther, Cortical interactions underlying the production of speech sounds, Journal of Communication Disorders, vol.39, issue.5, p.48, 2006.

F. H. Guenther and T. Vladusich, A neural theory of speech acquisition and production, Journal of Neurolinguistics, vol.25, issue.5, pp.408-422, 2012.

P. Hallé, A. Cristia, S. Fuchs, M. Weirich, D. Pape et al., Global and detailed speech representations in early language acquisition, Speech production and perception : Planning and dynamics, pp.11-38, 2012.

R. J. Hartsuiker and H. H. Kolk, Error monitoring in speech production : A computational test of the perceptual loop theory, Cognitive Psychology, vol.42, issue.2, pp.113-157, 2001.

B. Hayes and J. Et-white, Phonological naturalness and phonotactic learning, Linguistic Inquiry, vol.44, issue.1, p.56, 2013.

B. Hayes and C. Wilson, A maximum entropy model of phonotactics and phonotactic learning, Linguistic Inquiry, vol.39, issue.3, p.56, 2008.

A. F. Healy and J. E. Et-cutting, Units of speech perception : Phoneme and syllable, Journal of Verbal Learning and Verbal Behavior, vol.15, issue.1, p.24, 1976.

I. Heintz, M. E. Beckman, E. Fosler-lussier, and L. Et-ménard, Evaluating parameters for mapping adult vowels to imitative babbling, Proceedings of Interspeech, p.58, 2009.

G. Hickok, The functional neuroanatomy of language, Physics of Life Reviews, vol.6, issue.3, p.14, 2009.

G. Hickok, The role of mirror neurons in speech perception and action word semantics, Language and Cognitive Processes, vol.25, p.14, 2010.

G. Hickok, Computational neuroanatomy of speech production, Nature Reviews Neuroscience, vol.13, issue.2, p.26, 2012.

G. Hickok, M. Costanzo, R. Capasso, and G. Et-miceli, The role of Broca's area in speech perception : Evidence from aphasia revisited, Brain and Language, vol.119, issue.3, p.13, 2011.

G. Hickok and D. Poeppel, The cortical organization of speech processing, Nature Reviews Neuroscience, vol.8, issue.5, pp.393-402, 2007.

J. Hillenbrand, F. D. Minifie, and T. J. Edwards, Tempo of spectrum change as a cue in speechsound discrimination by infants, Journal of Speech, Language, and Hearing Research, vol.22, issue.1, p.27, 1979.

J. Hochmann and L. Et-papeo, The invariance problem in infancy : A pupillometry study, Psychological science, vol.25, issue.11, p.41, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01103725

H. Hollien, On vocal registers, Journal of Phonetics, vol.2, pp.125-143, 1974.

J. Hornstein and J. Santos-victor, A unified approach to speech production and recognition based on articulatory motor representations, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2007), p.58, 2007.

J. F. Houde and M. I. Jordan, Sensorimotor adaptation in speech production, Science, vol.279, issue.5354, pp.1213-1216, 1998.

J. F. Houde and M. I. Jordan, Sensorimotor adaptation of speech I : Compensation and adaptation, Journal of Speech, Language, and Hearing Research, vol.45, issue.2, pp.295-310, 2002.

J. F. Houde and S. S. Et-nagarajan, Speech production as state feedback control, Frontiers in Human Neuroscience, vol.5, issue.82, pp.1-14, 2011.

J. F. Houde, S. S. Nagarajan, and T. Et-heinks-maldonado, Dynamic cortical imaging of speech compensation for auditory feedback perturbations, The Journal of the Acoustical Society of America, vol.121, issue.5, pp.3045-3045, 2007.

I. S. Howard and P. Messum, Modeling the development of pronunciation in infant speech acquisition, Motor Control, vol.15, issue.1, pp.85-117, 2011.

A. Hyafil, L. Fontolan, C. Kabdebon, B. Gutkin, and A. Et-giraud, Speech encoding by coupled cortical theta and gamma oscillations, 2015.

M. Iacoboni, R. P. Woods, M. Brass, H. Bekkering, J. C. Mazziotta et al., Cortical mechanisms of human imitation, Science, vol.286, issue.5449, p.12, 1999.

H. Ishihara, Y. Yoshikawa, K. Miura, and M. Et-asada, Caregiver's sensorimotor magnets lead infant's vowel acquisition through auto mirroring, IEEE International Conference on Developmental and Learning (ICDL 2008), pp.49-54, 2008.

T. Ito, M. Tiede, and D. J. Et-ostry, Somatosensory function in speech perception, Proceedings of the National Academy of Sciences, vol.106, issue.4, p.203, 2009.

J. M. Iverson and M. K. Et-fagan, Infant vocal-motor coordination : precursor to the gesturespeech system ?, Child Development, vol.75, issue.4, p.31, 2004.

J. M. Iverson, A. J. Hall, L. Nickel, and R. H. Et-wozniak, The relationship between reduplicated babble onset and laterality biases in infant rhythmic arm movements, Brain and Language, vol.101, issue.3, p.31, 2007.

J. M. Iverson and E. Et-thelen, Hand, mouth and brain. the dynamic emergence of speech and gesture, Journal of Consciousness Studies, vol.6, p.31, 1999.

R. A. Jacobs, M. I. Jordan, S. J. Nowlan, and G. E. Et-hinton, Adaptive mixtures of local experts, Neural Computation, vol.3, issue.1, pp.79-87, 1991.

C. Jacquemot, E. Dupoux, and A. Et-bachoud-lévi, Breaking the mirror : Asymmetrical disconnection between the phonological input and output codes, Cognitive Neuropsychology, vol.24, issue.1, p.181, 2007.
URL : https://hal.archives-ouvertes.fr/hal-02326793

D. G. Jamieson and M. F. Et-cheesman, The adaptation of produced voice-onset time, Journal of Phonetics, vol.15, issue.1, p.15, 1987.

L. Jäncke, T. Wüstenberg, H. Scheich, and H. Et-heinze, Phonetic perception and the temporal cortex, NeuroImage, vol.15, issue.4, pp.733-746, 2002.

E. T. Jaynes, Probability theory : The logic of science, 2003.

E. K. Johnson and M. D. Tyler, Testing the limits of statistical learning for word segmentation, Developmental Science, vol.13, issue.2, pp.339-345, 2010.

K. Johnson, Talker variability in vowel perception, The Journal of the Acoustical Society of America, vol.98, issue.5, p.178, 1995.

K. Johnson, Speech perception without speaker normalization : An exemplar model, Talker variability in speech processing, pp.145-165, 1997.

J. A. Jones and D. E. Et-callan, Brain activity during audiovisual speech perception : an fMRI study of the McGurk effect, NeuroReport, vol.14, issue.8, p.13, 2003.

P. W. Jusczyk, From general to language-specific capacities : The WRAPSA model of how speech perception develops, Journal of Phonetics, vol.21, pp.3-28, 1993.

P. W. Jusczyk, The Discovery of Spoken Language, 1997.

P. W. Jusczyk and C. Derrah, Representation of speech sounds by young infants, Developmental Psychology, vol.23, issue.5, p.648, 1987.

P. W. Jusczyk, A. D. Friederici, J. M. Wessels, V. Y. Svenkerud, and A. M. Et-jusczyk, Infants' sensitivity to the sound patterns of native language words, Journal of Memory and Language, vol.32, issue.3, p.34, 1993.

P. W. Jusczyk and P. A. Luce, Infants' sensitivity to phonotactic patterns in the native language, Journal of Memory and Language, vol.33, issue.5, pp.630-645, 1994.

P. W. Jusczyk, J. Murray, and J. Bayly, Perception of place of articulation in fricatives and stops by infants, Proceedings of the Biennial Meeting of the Society for Research in Child Development, p.27, 1979.

P. W. Jusczyk, B. S. Rosner, J. E. Cutting, C. F. Foard, and L. B. Smith, Categorical perception of nonspeech sounds by 2-month-old infants, Perception, & Psychophysics, vol.21, issue.1, p.27, 1977.

H. Kanda, T. Ogata, K. Komatani, and H. G. Et-okuno, Segmenting acoustic signal with articulatory movement using recurrent neural network for phoneme acquisition, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2008), p.58, 2008.

H. Kanda, T. Ogata, T. Takahashi, K. Komatani, and H. G. Et-okuno, Continuous vocal imitation with self-organized vowel spaces in recurrent neural network, IEEE International Conference on Robotics and Automation (ICRA 2009), p.58, 2009.

M. Kawato, Internal models for motor control and trajectory planning, Current Opinion in Neurobiology, vol.9, issue.6, p.67, 1999.

R. D. Kent, Psychobiology of speech development : Coemergence of language and a movement system, American Journal of Physiology : Regulatory, Integrative and Comparative Physiology, vol.246, issue.6, p.31, 1984.

W. Kessen, J. Levine, and K. A. Et-wendrich, The imitation of pitch in infants, Infant Behavior and Development, vol.2, p.36, 1979.

E. Kidd, Implicit statistical learning is directly associated with the acquisition of syntax, Developmental Psychology, vol.48, issue.1, pp.171-184, 2012.

S. J. Kiebel, K. Von-kriegstein, J. Daunizeau, and K. J. Et-friston, Recognizing sequences of sequences, PLoS Computational Biology, vol.5, issue.8, 2009.

S. King, J. Frankel, K. Livescu, E. Mcdermott, K. Richmond et al., Speech production knowledge in automatic speech recognition, The Journal of the Acoustical Society of America, vol.121, issue.2, p.44, 2007.

K. Kirchhoff, Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments, Proceedings of the 5th International Conference on Spoken Language Processing, p.44, 1998.

D. H. Klatt, Speech perception : A model of acoustic-phonetic analysis and lexical access, pp.243-288, 1980.

D. F. Kleinschmidt and T. F. Jaeger, A bayesian belief updating model of phonetic recalibration and selective adaptation, Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics, vol.46, p.45, 2011.

D. F. Kleinschmidt and T. F. Jaeger, Robust speech perception : Recognize the familiar, generalize to the similar, and adapt to the novel, Psychological Review, vol.122, issue.2, pp.148-203, 2015.

K. R. Kluender, R. L. Diehl, and P. R. Et-killeen, Japanese quail can learn phonetic categories, Science, vol.237, issue.4819, pp.1195-1197, 1987.

B. J. Kröger, P. Birkholz, J. Kannampuzha, C. Et-neuschaefer-rube, A. Esposito et al., Categorical perception of consonants and vowels : evidence from a neurophonetic model of speech production and perception, pp.354-361, 2011.

B. J. Kröger, P. Birkholz, A. Lowit, and C. Et-neuschaefer-rube, Phonemic, sensory, and motor representations in an action-based neurocomputational model of speech production, Speech motor control : New developments in basic and applied research, pp.23-36, 2010.

B. J. Kröger and M. Cao, The emergence of phonetic-phonological features in a biologically inspired model of speech processing, Journal of Phonetics, vol.53, pp.88-100, 2015.

B. J. Kröger, J. Kannampuzha, and E. Et-kaufmann, Associative learning and self-organization as basic principles for simulating speech acquisition, speech production, and speech perception, EPJ Nonlinear Biomedical Physics, vol.2, issue.1, pp.1-28, 2014.

B. J. Kröger, J. Kannampuzha, and C. Et-neuschaefer-rube, Towards a neurocomputational model of speech production and perception, Speech Communication, vol.51, issue.9, pp.793-809, 2009.

P. K. Kuhl, Language, mind, and brain : Experience alters perception, The New Cognitive Neurosciences, vol.2, p.38, 2000.

P. K. Kuhl, Early language acquisition : cracking the speech code, Nature Reviews Neuroscience, vol.5, issue.11, pp.831-843, 2004.

P. K. Kuhl, Is speech learning "gated" by the social brain ?, Developmental Science, vol.10, issue.1, 2007.

P. K. Kuhl, Brain mechanisms in early language acquisition, Neuron, vol.67, issue.5, p.29, 2010.

P. K. Kuhl, S. Kiritani, T. Deguchi, A. Hayashi, E. B. Stevens et al., Effects of language experience on speech perception : American and Japanese infants' perception of /ra/ and /la, The Journal of the Acoustical Society of America, vol.102, issue.5, p.29, 1997.

P. K. Kuhl and A. N. Et-meltzoff, The bimodal perception of speech in infancy, Science, vol.218, issue.4577, p.36, 1982.

P. K. Kuhl and A. N. Et-meltzoff, Infant vocalizations in response to speech : Vocal imitation and developmental change, The Journal of the Acoustical Society of America, vol.100, issue.4, p.36, 1996.

P. K. Kuhl and J. D. Miller, Speech perception by the chinchilla : Voiced-voiceless distinction in alveolar plosive consonants, Science, vol.190, issue.4209, pp.69-72, 1975.

P. K. Kuhl and D. M. Et-padden, Enhanced discriminability at the phonetic boundaries for the place feature in macaques, The Journal of the Acoustical Society of America, vol.73, issue.3, p.38, 1983.

P. K. Kuhl, R. R. Ramirez, A. Bosseler, J. L. Lin, and T. Et-imada, Infants' brain responses to speech suggest analysis by synthesis, vol.111, pp.11238-11245, 2014.

P. K. Kuhl, E. B. Stevens, A. Hayashi, T. Deguchi, S. Kiritani et al., Infants show a facilitation effect for native language phonetic perception between 6 and 12 months, Developmental Science, vol.9, issue.2, 2006.

P. K. Kuhl, F. Tsao, and H. Liu, Foreign-language experience in infancy : Effects of short-term exposure and social interaction on phonetic learning, Proceedings of the National Academy of Sciences, vol.100, issue.15, pp.9096-9101, 2003.

P. K. Kuhl, K. A. Williams, F. Lacerda, K. N. Stevens, and B. Et-lindblom, Linguistic experience alters phonetic perception in infants by 6 months of age, Science, vol.255, issue.5044, p.41, 1992.

J. R. Lackner and L. M. Et-goldstein, The psychological representation of speech sounds, The Quarterly Journal of Experimental Psychology, vol.27, issue.2, pp.173-185, 1975.

D. R. Lametti, A. Rochet-capellan, E. Neufeld, D. M. Shiller, and D. J. Ostry, Plasticity in the human speech motor system drives changes in speech perception, Journal of Neuroscience, vol.34, issue.31, pp.10339-10346, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01053445

H. Lane and B. Tranel, The lombard sign and the role of hearing in speech, Journal of Speech, Language, and Hearing Research, vol.14, issue.4, p.15, 1971.

R. E. Lasky, A. Syrdal-lasky, and R. E. Et-klein, VOT discrimination by four to six and a half month old infants from Spanish environments, Journal of Experimental Child Psychology, vol.20, issue.2, p.28, 1975.

R. Laurent, COSMO : un modèle bayésien des interactions sensori-motrices dans la perception de la parole, 2014.

R. Laurent, M. Barnaud, J. Schwartz, P. Bessière, and J. Et-diard, The complementary roles of auditory and motor information evaluated in a Bayesian perceptuo-motor model of speech perception, Psychological Review, vol.124, issue.5, pp.572-602, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01484383

R. Laurent, J. Schwartz, P. Bessière, and J. Et-diard, A computational model of perceptuomotor processing in speech perception : learning to imitate and categorize synthetic CV syllables, International Speech Communication Association (ISCA). (Cité en, p.62, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00827885

O. Lebeltel, P. Bessière, J. Diard, and E. Et-mazer, Bayesian robot programming, Autonomous Robots, vol.16, issue.1, pp.49-79, 2004.
URL : https://hal.archives-ouvertes.fr/inria-00189723

A. Lelong, Convergence phonétique en interaction, p.15, 2012.

W. J. Levelt, Models of word production, Trends in Cognitive Sciences, vol.3, issue.6, p.51, 1999.

W. J. Levelt, A. Roelofs, and A. S. Meyer, A theory of lexical access in speech production, Behavioral and Brain Sciences, vol.22, issue.1, pp.1-38, 1999.

W. J. Levelt and L. Et-wheeldon, Do speakers have access to a mental syllabary ?, Cognition, vol.50, issue.1, pp.239-269, 1994.

A. Levitt, P. W. Jusczyk, J. Murray, and G. Et-carden, The perception of place of articulation contrasts in voiced and voiceless fricatives by two-month-old infants, Journal of Experimental Psychology : Human Perception and Performance, vol.14, p.27, 1988.

A. M. Liberman, Some results of research on speech perception, The Journal of the Acoustical Society of America, vol.29, issue.1, p.21, 1957.

A. M. Liberman, Speech : A special code, vol.10, 1996.

A. M. Liberman, F. S. Cooper, D. P. Shankweiler, and M. Studdert-kennedy, Perception of the speech code, Psychological Review, vol.74, issue.6, pp.431-461, 1967.

A. M. Liberman, P. C. Delattre, F. S. Cooper, and L. J. Et-gerstman, The role of consonant-vowel transitions in the perception of the stop and nasal consonants, Psychological Monographs : General and Applied, vol.68, issue.8, p.21, 1954.

A. M. Liberman and I. G. Mattingly, The motor theory of speech perception revised, Cognition, vol.21, issue.1, pp.1-36, 1985.

M. Lopes, F. S. Melo, B. Kenward, and J. Santos-victor, A computational model of sociallearning mechanisms, Adaptive Behavior, vol.17, issue.6, pp.467-483, 2009.

A. J. Lotto, G. Hickok, and L. L. Et-holt, Reflections on mirror neurons and speech perception, Trends in Cognitive Sciences, vol.13, issue.3, p.13, 2009.

P. A. Luce, S. D. Goldinger, E. T. Auer, and M. S. Vitevitch, Phonetic priming, neighborhood activation, and PARSYN. Attention, Perception, & Psychophysics, vol.62, p.45, 2000.

R. D. Luce, Response times : Their role in inferring elementary mental organization, p.18, 1986.

P. F. Macneilage, The frame/content theory of evolution of speech production, Behavioral and Brain Sciences, vol.21, issue.4, p.31, 1998.

P. F. Macneilage and B. L. Davis, Attention and Performance 13 : Motor representations and control, p.140, 1990.

P. F. Macneilage, B. L. Davis, and C. L. Et-matyear, Babbling and first words : Phonetic similarities and differences, Speech Communication, vol.22, issue.2-3, pp.269-277, 1997.

S. Maeda, Compensatory articulation during speech : Evidence from the analysis and synthesis of vocal-tract shapes using an articulatory model, Speech Production and Speech Modelling, p.96, 1990.

G. Magri, Error-driven versus batch models of the acquisition of phonotactics : David defeats Goliath, Proceedings of the Annual Meetings on Phonology, p.56, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01691010

B. Mampe, A. D. Friederici, A. Christophe, and K. Et-wermke, Newborns' cry melody is shaped by their native language, Current Biology, vol.19, issue.23, pp.1994-1997, 2009.

V. Mann and H. Wimmer, Phoneme awareness and pathways into literacy : A comparison of German and American children, Reading and Writing, vol.15, issue.7, pp.653-682, 2002.

K. L. Markey, The sensorimotor foundations of phonology : a computational model of early childhood articulatory and phonetic development, vol.58, 1994.

C. J. Markiewicz and J. W. Bohland, Mapping the cortical representation of speech sounds in a syllable repetition task, NeuroImage, vol.141, pp.174-190, 2016.

D. Marr, Vision : A computational investigation into the human representation and processing of visual information, vol.52, 1982.

W. D. Marslen-wilson and P. Warren, Levels of perceptual representation and process in lexical access : Words, phonemes, and features, Psychological Review, vol.101, issue.4, p.24, 1994.

A. Martin, S. Peperkamp, and E. Et-dupoux, Learning phonemes with a proto-lexicon, Cognitive Science, vol.37, issue.1, pp.103-124, 2013.

A. Martinet, Éléments de linguistique générale, p.64, 1970.

N. Masataka, Effects of contingent and noncontingent maternal stimulation on the vocal behaviour of three-to four-month-old Japanese infants, Journal of Child Language, vol.20, issue.2, p.36, 1993.

D. W. Massaro, Preperceptual images, processing time, and perceptual units in auditory perception, Psychological Review, vol.79, issue.2, p.21, 1972.

S. L. Mattys and J. F. Et-melhorn, How do syllables contribute to the perception of spoken English ? Insight from the migration paradigm, Language and Speech, vol.48, issue.2, p.22, 2005.

D. Maurer and J. F. Werker, Perceptual narrowing during infancy : A comparison of language and faces, Developmental Psychobiology, vol.56, issue.2, p.29, 2014.

L. Max, M. E. Wallace, and I. Vincent, Sensorimotor adaptation to auditory perturbations during speech : Acoustic and kinematic experiments, Proceedings of the 15th International Congress of Phonetic Sciences, p.16, 2003.

J. Maye, D. J. Weiss, and R. N. Et-aslin, Statistical phonetic learning in infants : Facilitation and feature generalization, Developmental Science, vol.11, issue.1, pp.122-134, 2008.

J. Maye, J. F. Werker, and L. Et-gerken, Infant sensitivity to distributional information can affect phonetic discrimination, Cognition, vol.82, issue.3, pp.101-111, 2002.

J. L. Mcclelland and J. L. Et-elman, The TRACE model of speech perception, Cognitive Psychology, vol.18, issue.1, pp.1-86, 1986.

C. Mcgettigan and P. Tremblay, Links between perception and production : examining the roles of motor and premotor cortices in understanding speech, Oxford Handbook of Psycholinguistics, p.14, 2017.

B. Mcmurray, R. N. Aslin, and J. C. Toscano, Statistical learning of phonetic categories : insights from a computational approach, Developmental Science, vol.12, issue.3, pp.369-378, 2009.

D. Mcneill and K. Lindig, The perceptual reality of phonemes, syllables, words, and sentences, Journal of Verbal Learning and Verbal Behavior, vol.12, issue.4, p.21, 1973.

J. M. Mcqueen and A. Cutler, Spoken word access processes : An introduction. Language and Cognitive Processes, vol.16, p.24, 2001.

J. M. Mcqueen, A. Cutler, and D. Norris, Why Merge really is autonomous and parsimonious, 2000 ISCA Tutorial and Research Workshop (ITRW) on Spoken Word Access Processes, pp.47-50, 2000.

J. Mehler, J. Y. Dommergues, U. H. Frauenfelder, and J. Segui, The syllable's role in speech segmentation, Journal of Verbal Learning and Verbal Behavior, vol.20, issue.3, pp.298-305, 1981.

J. Mehler and R. Hayes, The role of syllables in speech processing : Infant and adult data, Philosophical Transactions of the Royal Society of London B : Biological Sciences, vol.295, pp.333-352, 1077.

R. P. Meier, L. Mcgarvin, R. A. Zakia, and R. Et-willerman, Silent mandibular oscillations in vocal babbling, Phonetica, vol.54, issue.3-4, p.31, 1997.

I. G. Meister, S. M. Wilson, C. Deblieck, A. D. Wu, and M. Et-iacoboni, The essential role of premotor cortex in speech perception, Current Biology, vol.17, issue.19, pp.1692-1696, 2007.

A. N. Meltzoff and M. K. Moore, Imitation of facial and manual gestures by human neonates, Science, vol.198, issue.4312, p.36, 1977.

L. Ménard and J. Schwartz, Perceptuo-motor biases in the perceptual organization of the height feature in french vowels, Acta Acustica united with Acustica, vol.100, issue.4, pp.676-689, 2014.

L. Ménard, J. Schwartz, and J. Et-aubin, Invariance and variability in the production of the height feature in French vowels, Speech Communication, vol.50, issue.1, pp.14-28, 2008.

L. Ménard, J. Schwartz, and L. Et-boë, Role of vocal tract morphology in speech development : Perceptual targets and sensorimotor maps for synthesized French vowels from birth to adulthood, Journal of Speech, Language, and Hearing Research, vol.47, issue.5, pp.1059-1080, 2004.

L. Ménard, J. Schwartz, L. Boë, S. Kandel, and N. Et-vallée, Auditory normalization of French vowels synthesized by an articulatory model simulating growth from birth to adulthood, The Journal of the Acoustical Society of America, vol.111, issue.4, p.178, 2002.

R. Meringer and K. Mayer, Versprechen und Verlesen : eine Psychologish-linguistische Studie, vol.23, 1895.

P. Messum, The role of imitation in learning to pronounce, p.57, 2008.

P. Messum and I. S. Howard, Creating the cognitive form of phonological units : The speech sound correspondence problem in infancy could be solved by mirrored vocal interactions rather than by imitation, Journal of Phonetics, vol.53, pp.125-140, 2015.

C. ;. Meunier, V. Rolland-monnoury, S. Pinto, and C. Et-ozsancak, Phonétique acoustique, Les dysarthries, pp.164-173, 2007.

G. Miceli, G. Gainotti, C. Caltagirone, and C. Et-masullo, Some aspects of phonological impairment in aphasia, Brain and Language, vol.11, issue.1, p.13, 1980.

C. B. Mills, Effects of context on reaction time to phonemes, Journal of Verbal Learning and Verbal Behavior, vol.19, issue.1, p.21, 1980.

D. Mirman, J. L. Mcclelland, and L. L. Holt, An interactive hebbian account of lexically guided tuning of speech perception, Psychonomic Bulletin & Review, vol.13, issue.6, pp.958-965, 2006.

P. R. Mitchell and R. D. Kent, Phonetic variation in multisyllable babbling, Journal of Child Language, vol.17, issue.2, p.31, 1990.

H. Mitterer, O. Scharenborg, and J. M. Et-mcqueen, Phonological abstraction without phonemes in speech perception, Cognition, vol.129, issue.2, p.24, 2013.

K. Miura, Y. Yoshikawa, and M. Et-asada, Unconscious anchoring in maternal imitation that helps find the correspondence of a caregiver's vowel categories, Advanced Robotics, vol.21, issue.13, p.59, 2007.

K. Miura, Y. Yoshikawa, and M. Et-asada, Realizing being imitated : Vowel mapping with clearer articulation, IEEE International Conference on Developmental and Learning (ICDL 2008), p.59, 2008.

K. Miura, Y. Yoshikawa, and M. Et-asada, Vowel acquisition based on an auto-mirroring bias with a less imitative caregiver, Advanced Robotics, vol.26, issue.1-2, pp.23-44, 2012.

J. Morais, Literacy and awareness of the units of speech : Implications for research on the units of perception, Linguistics, vol.23, issue.5, p.22, 1985.

J. Morais, L. Cary, J. Alegria, and P. Et-bertelson, Does awareness of speech as a sequence of phones arise spontaneously ?, Cognition, vol.7, issue.4, pp.323-331, 1979.

J. Morais, A. Content, L. Cary, J. Mehler, and J. Et-segui, Syllabic segmentation and literacy, Language and Cognitive Processes, vol.4, issue.1, p.22, 1989.

B. Morillon, C. Liégeois-chauvel, L. H. Arnal, C. Bénar, and A. Et-giraud, Asymmetric function of theta and gamma activity in syllable processing : an intra-cortical study, Frontiers in Psychology, vol.3, p.25, 2012.
URL : https://hal.archives-ouvertes.fr/hal-02088050

R. Möttönen, R. Dutton, and K. E. Et-watkins, Auditory-motor processing of speech sounds, Cerebral Cortex, vol.23, issue.5, p.13, 2012.

R. Möttönen and K. E. Watkins, Motor representations of articulators contribute to categorical perception of speech sounds, Journal of Neuroscience, vol.29, issue.31, pp.9819-9825, 2009.

C. Moulin-frier, Rôle des relations perception-action dans la communication parlée et l'émergence des systèmes phonologiques : étude, modélisation computationnelle et simulations, 2011.

C. Moulin-frier, J. Diard, J. Schwartz, and P. Et-bessière, COSMO ("Communicating about Objects using Sensory-Motor Operations") : A Bayesian modeling framework for studying speech communication and the emergence of phonological systems, Journal of Phonetics, vol.53, pp.5-41, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01230175

C. Moulin-frier, R. Laurent, P. Bessière, J. Schwartz, and J. Et-diard, Adverse conditions improve distinguishability of auditory, motor, and perceptuo-motor theories of speech perception : An exploratory Bayesian modelling study, Language and Cognitive Processes, vol.27, issue.7-8, pp.1240-1263, 2012.

C. Moulin-frier, S. M. Nguyen, and P. Oudeyer, Self-organization of early vocal development in infants and machines : the role of intrinsic motivation, Frontiers in Psychology, vol.4, p.58, 1006.
URL : https://hal.archives-ouvertes.fr/hal-00927940

C. Moulin-frier and P. Oudeyer, Curiosity-driven phonetic learning, The 2nd Joint IEEE International Conference on Developmental and Learning and on Epigenetic Robotics (ICDL-Epirob 2012), p.57, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00762795

C. Moulin-frier and P. Oudeyer, Exploration strategies in developmental robotics : a unified probabilistic framework, The 3rd Joint IEEE International Conference on Developmental and Learning and on Epigenetic Robotics (ICDL-Epirob 2013), pp.1-6, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00860641

D. E. Mowrer, Phonological development during the first year of life. Speech and Language : Advances in Basic Research and Practice, vol.4, p.29, 1980.

B. Munson, J. Edwards, and M. E. Et-beckman, Phonological representations in language acquisition : Climbing the ladder of abstraction, The Oxford Handbook of Laboratory Phonology, p.36, 2011.

M. Murakami, B. J. Kröger, P. Birkholz, and J. Et-triesch, Seeing [u] aids vocal learning : Babbling and imitation of vowels using a 3D vocal tract model, reinforcement learning, and reservoir computing, The 5th Joint IEEE International Conference Developmental Learning and Epigenetic Robotics, pp.208-213, 2015.

R. Näätänen, T. Kujala, and I. Et-winkler, Auditory processing that leads to conscious perception : a unique window to central auditory processing opened by the mismatch negativity and related responses, Psychophysiology, vol.48, issue.1, p.24, 2011.

R. Näätänen, A. Lehtokoski, M. Lennes, M. Cheour, M. Huotilainen et al., Language-specific phoneme representations revealed by electric and magnetic brain responses, Nature, vol.385, issue.6615, pp.432-434, 1997.

R. Näätänen, P. Paavilainen, T. Rinne, and K. Et-alho, The mismatch negativity (MMN) in basic research of central auditory processing : a review, Clinical Neurophysiology, vol.118, issue.12, p.24, 2007.

S. Najnin and B. Banerjee, Emergence of vocal developmental sequences in a predictive coding model of speech acquisition, Proceedings of Interspeech 2016, vol.60, pp.1113-1117, 2016.

D. G. Nelson, P. W. Jusczyk, D. R. Mandel, J. Myers, A. Turk et al., The head-turn preference procedure for testing auditory perception, Infant Behavior and Development, vol.18, issue.1, p.28, 1995.

R. S. Newman, Using links between speech perception and speech production to evaluate different acoustic metrics : A preliminary report, The Journal of the Acoustical Society of America, vol.113, issue.5, p.19, 2003.

S. Nittrouer, Challenging the notion of innate phonetic boundaries, The Journal of the Acoustical Society of America, vol.110, issue.3, p.27, 2001.

D. Norris, Shortlist : A connectionist model of continuous speech recognition, Cognition, vol.52, issue.3, pp.189-234, 1994.

D. Norris and A. Cutler, The relative accessibility of phonemes and syllables, Perception & Psychophysics, vol.43, issue.6, pp.541-550, 1988.

D. Norris and J. M. Et-mcqueen, Shortlist B : a Bayesian model of continuous speech recognition, Psychological Review, vol.115, issue.2, p.357, 2008.

D. Norris, J. M. Mcqueen, and A. Cutler, Merging information in speech recognition : Feedback is never necessary, Behavioral and Brain Sciences, vol.23, issue.3, pp.299-325, 2000.

N. Nozari, G. S. Dell, and M. F. Schwartz, Is comprehension necessary for error detection ? A conflict-based account of monitoring in speech production, Cognitive Psychology, vol.63, issue.1, pp.1-33, 2011.

J. J. Ohala, B. L. Derwing, T. M. Nearey, and M. L. Dow, On the phoneme as the unit of the "second articulation, Phonology, vol.3, issue.1, p.23, 1986.

V. Ojanen, R. Möttönen, J. Pekkola, I. P. Jääskeläinen, R. Joensuu et al., Processing of audiovisual speech in Broca's area, NeuroImage, vol.25, issue.2, p.13, 2005.

D. K. Oller, G. Yeni-komshian, J. F. Kavanagh, and C. A. Et-ferguson, The emergence of the sounds of speech in infancy, Child Phonology, vol.1, pp.93-112, 1980.

D. K. Oller, The emergence of the capacity for speech, 2000.

D. K. Oller and R. E. Et-eilers, The role of audition in infant babbling, Child Development, vol.59, issue.2, pp.441-449, 1988.

T. Otake, G. Hatano, A. Cutler, and J. Et-mehler, Mora or syllable ? Speech segmentation in Japanese, Journal of Memory and Language, vol.32, issue.2, p.22, 1993.

P. Oudeyer, Phonemic coding might result from sensory-motor coupling dynamics, Proceedings of the 7th International Conference on Simulation of Adaptive Behavior : From Animals to Animats, p.58, 2002.

P. Oudeyer, A. Baranes, and F. Et-kaplan, Intrinsically motivated exploration for developmental and active sensorimotor learning, p.185, 2010.
DOI : 10.1007/978-3-642-05181-4_6

URL : http://www.pyoudeyer.com/OudeyerBaranesKaplanMotorLearning09.pdf

C. Pallier, Phonemes and syllables in speech perception : size of the attentional focus in French, Proceedings of Eurospeech-97, vol.23, 1997.

M. Papou?ek and H. Papou?ek, Musical elements in the infant's vocalization : Their significance for communication, cognition, and creativity, Advances in Infancy Research, p.36, 1981.

J. S. Pardo, Measuring phonetic convergence in speech production, Frontiers in Psychology, vol.4, issue.559, p.15, 2013.

J. Patri, J. Diard, and P. Et-perrier, Modélisation bayésienne de la planification motrice des gestes de parole : Evaluation du rôle des différentes modalités sensorielles, 31ème Journées d'Études sur la Parole, pp.419-427, 2016.

J. Patri, J. Diard, J. Schwartz, and P. Pascal, What drives the perceptual change resulting from speech motor adaptation ? Evaluation of hypotheses in a Bayesian modeling framework, PLoS Computational Biology, p.203
URL : https://hal.archives-ouvertes.fr/hal-01701562

M. G. Peeva, F. H. Guenther, J. A. Tourville, A. Nieto-castanon, J. Anton et al., Distinct representations of phonemes, syllables, and supra-syllabic sequences in the speech production network, NeuroImage, vol.50, issue.2, pp.626-638, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01440436

B. Pelucchi, J. F. Hay, and J. R. Et-saffran, Statistical learning in a natural language by 8-monthold infants, Child Development, vol.80, issue.3, pp.674-685, 2009.

S. Peperkamp, Phonological acquisition : Recent attainments and new challenges, Language and Speech, vol.46, issue.2-3, pp.87-113, 2003.

S. Peperkamp, R. Le-calvez, J. Nadal, and E. Et-dupoux, The acquisition of allophonic rules : Statistical learning with linguistic constraints, Cognition, vol.101, issue.3, pp.31-41, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00143852

J. S. Perkell, F. H. Guenther, H. Lane, M. L. Matthies, E. Stockmann et al., The distinctness of speakers' productions of vowel contrasts is related to their discrimination of the contrasts, The Journal of the Acoustical Society of America, vol.116, issue.4, p.19, 2004.

J. S. Perkell and D. H. Et-klatt, Invariance and variability in speech processes, p.14, 1986.

J. S. Perkell, H. Lane, S. Ghosh, M. L. Matthies, M. Tiede et al., Mechanisms of vowel production : auditory goals and speaker acuity, Proceedings of the 8th International Seminar on Speech Production, p.19, 2008.

J. S. Perkell, M. L. Matthies, M. Tiede, H. Lane, M. Zandipour et al., The distinctness of speakers' /s/-/S/ contrast is related to their auditory discrimination and use of an articulatory saturation effect, Journal of Speech, Language, and Hearing Research, vol.47, issue.6, p.19, 2004.

A. K. Philippsen, F. Reinhart, and B. Et-wrede, Efficient bootstrapping of vocalization skills using active goal babbling, International Workshop on Speech Robotics at Interspeech. (Cité en, p.57, 2015.

A. K. Philippsen, R. F. Reinhart, and B. Et-wrede, Learning how to speak : Imitation-based refinement of syllable production in an articulatory-acoustic model, The 4th Joint IEEE International Conference on Developmental and Learning and on Epigenetic Robotics, pp.195-200, 2014.

A. K. Philippsen, R. F. Reinhart, and B. Et-wrede, Goal babbling of acoustic-articulatory models with adaptive exploration noise, The 6th Joint IEEE International Conference on Developmental and Learning and on Epigenetic Robotics, pp.72-78, 2016.

M. J. Pickering and S. Garrod, An integrated theory of language production and comprehension, Behavioral and Brain Sciences, vol.36, issue.4, pp.329-347, 2013.

J. B. Pierrehumbert, Exemplar dynamics : word frequency, lenition, and contrast, Frequency effects and the emergence of linguistic structure, p.55, 2001.

J. B. Pierrehumbert, Phonetic diversity, statistical learning, and acquisition of phonology, Language and Speech, vol.46, issue.2-3, pp.115-154, 2003.

J. B. Pierrehumbert, Phonological representation : beyond abstract versus episodic, Annual Review of Linguistics, vol.2, pp.33-52, 2016.

M. A. Pitt and A. G. Samuel, Attentional allocation during speech perception : How fine is the focus ?, Journal of Memory and Language, vol.29, issue.5, pp.611-632, 1990.

D. Poeppel, The analysis of speech in different temporal integration windows : cerebral lateralization as "asymmetric sampling in time, Speech Communication, vol.41, issue.1, pp.245-255, 2003.

D. Poeppel, T. Overath, A. N. Popper, and R. R. Fay, The Human Auditory Cortex, p.176, 2012.

L. Polka, S. Rvachew, and K. Et-mattock, Experiential influences on speech perception and speech production in infancy, Blackwell Handbook of Language Development, p.36, 2007.

L. Polka and J. F. Werker, Developmental changes in perception of nonnative vowel contrasts, Journal of Experimental Psychology : Human Perception and Performance, vol.20, issue.2, p.421, 1994.

R. J. Porter and F. X. Castellanos, Speech-production measures of speech perception : Rapid shadowing of VCV syllables, The Journal of the Acoustical Society of America, vol.67, issue.4, p.18, 1980.

A. Postma, Detection of errors during speech production : A review of speech monitoring models, Cognition, vol.77, issue.2, p.51, 2000.

C. J. Price, A review and synthesis of the first 20 years of PET and fMRI studies of heard speech, spoken language and reading, NeuroImage, vol.62, issue.2, p.26, 2012.

F. Pulvermüller and L. Fadiga, Active perception : sensorimotor circuits as a cortical basis for language, Nature Reviews Neuroscience, vol.11, issue.5, p.14, 2010.

F. Pulvermüller, M. Huss, F. Kherif, F. M. Del-prado-martin, O. Hauk et al., Motor cortex maps articulatory features of speech sounds, Proceedings of the National Academy of Sciences, vol.103, issue.20, p.13, 2006.

D. W. Purcell and K. G. Et-munhall, Adaptive control of vowel formant frequency : Evidence from real-time formant manipulation, The Journal of the Acoustical Society of America, vol.120, issue.2, p.16, 2006.

L. Rapin, J. Schwartz, and L. Et-ménard, Are idiosyncrasies in vowel production free or learned ? A study of variants of the French vowel system in biological brothers, The Journal of the Acoustical Society of America, vol.141, issue.5, pp.3582-3582, 2017.

J. P. Rauschecker and S. K. Scott, Maps and streams in the auditory cortex : nonhuman primates illuminate human speech processing, Nature Neuroscience, vol.12, issue.6, pp.718-724, 2009.

R. E. Remez, Analogy and disanalogy in production and perception of speech. Language, Cognition and Neuroscience, vol.30, pp.273-286, 2015.

C. Richter, N. H. Feldman, H. Salgado, and A. Et-jansen, A framework for evaluating speech representations, Proceedings of the 38th Annual Conference of the Cognitive Science Society, p.51, 2016.

G. Rizzolatti, L. Fadiga, V. Gallese, and L. Et-fogassi, Premotor cortex and the recognition of motor actions, Cognitive Brain Research, vol.3, issue.2, pp.131-141, 1996.

G. Rizzolatti, L. Fadiga, M. Matelli, V. Bettinardi, E. Paulesu et al., Localization of grasp representations in humans by PET : 1. Observation versus execution, Experimental Brain Research, vol.111, issue.2, p.12, 1996.

C. Rogalsky, T. Love, D. Driscoll, S. W. Anderson, and G. Et-hickok, Are mirror neurons the basis of speech perception ? Evidence from five cases with damage to the purported human mirror system, Neurocase, vol.17, issue.2, p.13, 2011.

J. C. Rogers, R. Möttönen, R. Boyles, and K. E. Et-watkins, Discrimination of speech and non-speech sounds following theta-burst stimulation of the motor cortex, Frontiers in Psychology, vol.5, p.13, 2014.

M. Rolf and J. J. Steil, Goal babbling : a new concept for early sensorimotor exploration, Humanoid Robots, Workshop on Developmental Robotics : Can developmental robotics yield human-like cognitive abilities ? (Cité en, p.57, 2012.

M. Rolf, J. J. Steil, and M. Et-gienger, Goal babbling permits direct learning of inverse kinematics, IEEE Transactions on Speech and Audio Processing, vol.2, issue.3, pp.216-229, 2010.

M. Rolf, J. J. Steil, and M. Et-gienger, Online goal babbling for rapid bootstrapping of inverse models in high dimensions, IEEE International Conference on Developmental and Learning, vol.2, p.57, 2011.

L. Roug, I. Landberg, and L. Et-lundberg, Phonetic development in early infancy : A study of four Swedish children during the first eighteen months of life, Journal of Child Language, vol.16, issue.1, pp.19-40, 1989.

P. Rubin, M. T. Turvey, and P. Et-van-gelder, Initial phonemes are detected faster in spoken words than in spoken nonwords, Perception & Psychophysics, vol.19, issue.5, p.21, 1976.

J. R. Saffran, Statistical language learning mechanisms and constraints, Current Directions in Psychological Science, vol.12, issue.4, pp.110-114, 2003.

J. R. Saffran, R. N. Aslin, and E. L. Newport, Statistical learning by 8-month-old infants, Science, vol.274, pp.1926-1928, 1996.

A. Saghiran, COSMO WordPhon, une modélisation de l'influence de l'information lexicale dans l'apprentissage des catégories phonémiques, 2017.

N. H. Salminen, H. Tiitinen, and P. J. Et-may, Modeling the categorical perception of speech sounds : A step toward biological plausibility, Cognitive, Affective, & Behavioral Neuroscience, vol.9, issue.3, p.56, 2009.

B. Samlowski, The syllable as a processing unit in speech production : evidence from frequency effects on coarticulation, p.22, 2016.

A. G. Samuel, Speech perception, Annual Review of Psychology, vol.62, p.14, 2011.

B. D. Sarma and S. M. Prasanna, Acoustic-phonetic analysis for speech recognition : A review, IETE Technical Review, p.44, 2017.

M. Sato, K. Grabski, M. Garnier, L. Granjon, J. Schwartz et al., Converging toward a common speech code : imitative and perceptuo-motor recalibration processes in speech production, Frontiers in Psychology, vol.4, issue.422, p.15, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00874984

M. Sato, K. Grabski, A. M. Glenberg, A. Brisebois, A. Basirat et al., Articulatory bias in speech categorization : Evidence from use-induced motor plasticity, Cortex, vol.47, issue.8, p.13, 2011.

M. Sato, P. Tremblay, and V. L. Et-gracco, A mediating role of the premotor cortex in phoneme segmentation, Brain and Language, vol.111, issue.1, p.13, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00438181

H. B. Savin and T. G. Bever, The nonperceptual reality of the phoneme, Journal of Verbal Learning and Verbal Behavior, vol.9, issue.3, pp.295-302, 1970.

O. Scharenborg, Fine-phonetic variation in a computational model of word recognition, The Journal of the Acoustical Society of America, vol.123, issue.5, p.45, 2008.

O. Scharenborg and L. Boves, Computational modelling of spoken-word recognition processes : Design choices and evaluation, Pragmatics & Cognition, vol.18, issue.1, p.44, 2010.

O. Scharenborg, D. Norris, L. Bosch, and J. M. Et-mcqueen, How should a speech recognizer work ?, Cognitive Science, vol.29, issue.6, p.45, 2005.

N. O. Schiller, Phonological encoding in speech production, 2006 ISCA Tutorial and Research Workshop (ITRW) on Experimental Linguistics, p.51, 2006.

M. R. Schomers and F. Pulvermüller, Is the sensorimotor cortex relevant for speech perception and understanding ? an integrative review, Frontiers in Human Neuroscience, vol.10, issue.435, pp.1-18, 2016.

M. Schroeder, B. Atal, and J. Hall, Objective measure of certain speech signal degradations based on masking properties of human auditory perception, Frontiers of Speech Communication Research, p.96, 1979.

J. Schroeter and M. M. Sondhi, Techniques for estimating vocal-tract shapes from the speech signal, IEEE Transactions on Speech and Audio Processing, vol.2, issue.1, p.176, 1994.
DOI : 10.1109/89.260356

W. L. Schuerman, A. S. Meyer, and J. M. Et-mcqueen, Mapping the speech code : Cortical responses linking the perception and production of vowels, Frontiers in Human Neuroscience, vol.11, issue.161, p.17, 2017.

J. Schwartz, A. Basirat, L. Ménard, and M. Sato, The Perception-for-Action-Control Theory (PACT) : A perceptuo-motor theory of speech perception, Journal of Neurolinguistics, vol.25, issue.5, p.12, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00442367

J. Schwartz, L. Boë, N. Vallée, and C. Et-abry, The dispersion-focalization theory of vowel systems, Journal of Phonetics, vol.25, issue.3, p.113, 1997.
DOI : 10.1006/jpho.1997.0043

J. Schwartz, P. Escudier, P. ;. Et-teissier, . Iste, and U. K. London, Multimodal speech : Two or three senses are better than one, Language and Speech Processing, pp.377-415, 2010.
DOI : 10.1002/9780470611180.ch11

J. Schwartz, J. Robert-ribes, P. Escudier, B. Burnham, D. Campbell et al., Ten years after Summerfield : a taxonomy of models for audio-visual fusion in speech perception, pp.85-108, 1998.

S. K. Scott and I. S. Et-johnsrude, The neuroanatomical and functional organization of speech perception, Trends in Neurosciences, vol.26, issue.2, p.14, 2003.
DOI : 10.1016/s0166-2236(02)00037-1

URL : http://pspl.technion.ac.il/~karniel/CMCC/Neuroanatomy_of_speech_perception.pdf

J. Segui, U. H. Frauenfelder, and J. Et-mehler, Phoneme monitoring, syllable monitoring and lexical access, British Journal of Psychology, vol.72, issue.4, p.21, 1981.
DOI : 10.1111/j.2044-8295.1981.tb01776.x

URL : https://archive-ouverte.unige.ch/unige:83840/ATTACHMENT01

A. Seidl and A. Cristia, Infants' learning of phonological status, Frontiers in Psychology, vol.3, p.448, 2012.
DOI : 10.3389/fpsyg.2012.00448

URL : https://www.frontiersin.org/articles/10.3389/fpsyg.2012.00448/pdf

W. Serniclaes and L. Sprenger-charolles, Categorical perception of speech sounds and dyslexia. Current psychology letters, Behaviour, brain & cognition, vol.1, issue.10, pp.1-8, 2003.

A. Sharma and M. F. Dorman, Cortical auditory evoked potential correlates of categorical perception of voice-onset time, The Journal of the Acoustical Society of America, vol.106, issue.2, p.24, 1999.

D. M. Shiller, M. Sato, V. L. Gracco, and S. R. Baum, Perceptual recalibration of speech sounds following speech motor learning, The Journal of the Acoustical Society of America, vol.125, issue.2, pp.1103-1113, 2009.
DOI : 10.1121/1.3058638

URL : https://hal.archives-ouvertes.fr/hal-00366425

Y. Shtyrov, T. Kujala, J. Ahveninen, M. Tervaniemi, P. Alku et al., Background acoustic noise and the hemispheric lateralization of speech processing in the human brain : magnetic mismatch negativity study, Neuroscience Letters, vol.251, issue.2, p.24, 1998.

Y. Shtyrov, T. Kujala, S. Palva, R. J. Ilmoniemi, and R. Et-näätänen, Discrimination of speech and of complex nonspeech sounds of different temporal structure in the left and right cerebral hemispheres, NeuroImage, vol.12, issue.6, p.24, 2000.

W. T. Siok, Z. Jin, P. Fletcher, and L. H. Et-tan, Distinct brain regions associated with syllable and phoneme, Human Brain Mapping, vol.18, issue.3, pp.201-207, 2003.
DOI : 10.1002/hbm.10094

B. F. Skinner, Verbal Behavior. BF Skinner Foundation, vol.33, 1957.

J. I. Skipper, J. T. Devlin, and D. R. Et-lametti, The hearing ear is always found close to the speaking tongue : Review of the role of the motor system in speech perception, Brain and Language, vol.164, p.14, 2017.

J. I. Skipper, V. Van-wassenhove, H. C. Nusbaum, and S. L. Small, Hearing lips and seeing voices : How cortical areas supporting speech production mediate audiovisual speech perception, Cerebral Cortex, vol.17, issue.10, pp.2387-2399, 2007.
DOI : 10.1093/cercor/bhl147

URL : https://academic.oup.com/cercor/article-pdf/17/10/2387/888794/bhl147.pdf

B. L. Smith, S. Brown-sweeney, and C. Et-stoel-gammon, A quantitative analysis of reduplicated and variegated babbling, First Language, vol.9, issue.6, p.31, 1989.
DOI : 10.1177/014272378900900605

R. E. Stark, Stages of speech development in the first year of life, Child phonology, vol.1, pp.73-90, 1980.

A. Stasenko, F. E. Garcea, and B. Z. Et-mahon, What happens to the motor theory of perception when the motor system is damaged ?, Language and Cognition, vol.5, issue.2-3, p.14, 2013.

R. Stetson, Motor Phonetics : a study of speech movements in articulation, p.21, 1951.

K. N. Stevens, On the quantal nature of speech, Journal of Phonetics, vol.17, issue.1, p.79, 1989.

K. N. Stevens and S. J. Keyser, Quantal theory, enhancement and overlap, Journal of Phonetics, vol.38, issue.1, p.79, 2010.

M. Studdert-kennedy, From continuous signal to discrete message : Syllable to phoneme, The role of speech in language, p.21, 1975.

Q. Summerfield, Some preliminaries to a compre hensive account of audio-visual speech perception, pp.3-226, 1987.

H. M. Sussman, A neuronal model for syllable representation, Brain and Language, vol.22, issue.1, pp.167-177, 1984.

D. Swingley, Statistical clustering and the contents of the infant vocabulary, Cognitive Psychology, vol.50, issue.1, pp.86-132, 2005.

T. Taniguchi, S. Nagasaka, and R. Et-nakashima, Nonparametric bayesian double articulation analyzer for direct language acquisition from continuous speech signals, IEEE Transactions on Cognitive and Developmental Systems, vol.8, issue.3, p.56, 2016.

T. Teinonen, R. N. Aslin, P. Alku, and G. Et-csibra, Visual speech contributes to phonetic learning in 6-month-old infants, Cognition, vol.108, issue.3, p.35, 2008.

E. Thelen, Rhythmical behavior in infancy : An ethological perspective, Developmental Psychology, vol.17, issue.3, p.31, 1981.

S. P. Thompson and E. L. Newport, Statistical learning of syntax : The role of transitional probability, Language Learning and Development, vol.3, issue.1, pp.1-42, 2007.

I. Toni, F. P. De-lange, M. L. Noordzij, and P. Et-hagoort, Language beyond action, Journal of Physiology-Paris, vol.102, issue.1, p.13, 2008.

J. A. Tourville and F. H. Guenther, The DIVA model : A neural theory of speech acquisition and production, Language and Cognitive Processes, vol.26, pp.952-981, 2011.

S. E. Trehub, Infants' sensitivity to vowel and tonal contrasts, Developmental Psychology, vol.9, issue.1, p.27, 1973.

S. E. Trehub, The discrimination of foreign speech contrasts by infants and adults, Child Development, vol.47, issue.2, p.27, 1976.

A. Treille, Percevoir et agir : La nature sensorimotrice, multisensorielle et prédictive de la perception de la parole, 2017.

F. Tsao, H. Liu, P. K. Kuhl, and C. Tseng, Perceptual discrimination of a Mandarin fricative-affricate contrast by English-learning and Mandarin-learning infants, Poster presented at the International Meeting of the Society on Infant Studies, p.29, 2000.

G. K. Vallabha, J. L. Mcclelland, F. Pons, J. F. Werker, and S. Et-amano, Unsupervised learning of vowel categories from infant-directed speech, Proceedings of the National Academy of Sciences, vol.104, issue.33, pp.13273-13278, 2007.

B. Varadarajan, S. Khudanpur, and E. Et-dupoux, Unsupervised learning of acoustic sub-word units, Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies, p.56, 2008.

M. Vaz, H. Brandl, F. Joublin, and C. Et-goerick, Learning from a tutor : Embodied speech acquisition and imitation learning, IEEE International Conference on Developmental and Learning (ICDL 2009), pp.1-6, 2009.
DOI : 10.1109/devlrn.2009.5175543

M. M. Vihman and U. K. Chichester, Phonological development : The first two years, vol.27, 2013.

M. M. Vihman and S. Nakai, Experimental evidence for an effect of vocal experience on infant speech perception, Proceedings of the 15th International Congress of Phonetic Sciences, p.36, 2003.

V. M. Villacorta, J. S. Perkell, and F. H. Guenther, Sensorimotor adaptation to feedback perturbations of vowel acoustics and its relation to perception, The Journal of the Acoustical Society of America, vol.122, issue.4, pp.2306-2319, 2007.
DOI : 10.1121/1.2773966

A. Vinter, The role of movement in eliciting early imitations, Child Development, vol.57, issue.1, p.36, 1986.

G. Walker, Computational Modeling of Speech Production and Aphasia, p.51, 2016.

A. S. Warlaumont, A spiking neural network model of canonical babbling development, The 2nd Joint IEEE International Conference on Developmental and Learning and on Epigenetic Robotics (ICDL-Epirob 2012), pp.1-6, 2012.
DOI : 10.1109/devlrn.2012.6400842

A. S. Warlaumont, G. Westermann, E. H. Buder, and D. K. Oller, Prespeech motor learning in a neural network using reinforcement, Neural Networks, vol.38, pp.64-75, 2013.
DOI : 10.1016/j.neunet.2012.11.012

URL : http://europepmc.org/articles/pmc3541464?pdf=render

A. S. Warlaumont, G. Westermann, and D. K. Et-oller, Self-production facilitates and adult input interferes in a neural network model of infant vowel imitation, AISB 2011 Computational Models of Cognitive Development. Society for the Study of Artificial Intelligence and the Simulation of Behaviour, p.58, 2011.

K. E. Watkins, A. P. Strafella, and T. Et-paus, Seeing and hearing speech excites the motor system involved in speech production, Neuropsychologia, vol.41, issue.8, p.13, 2003.
DOI : 10.1016/s0028-3932(02)00316-0

A. Weber and O. Scharenborg, Models of spoken-word recognition, Wiley Interdisciplinary Reviews : Cognitive Science, vol.3, issue.3, p.44, 2012.
DOI : 10.1002/wcs.1178

URL : http://pubman.mpdl.mpg.de/pubman/item/escidoc:1169572/component/escidoc:1451148/Weber_WIREs_2012.pdf

J. F. Werker, Cross-language speech perception : Development change does not involve loss, The Development of Speech Perception, pp.95-120, 1994.

J. F. Werker, J. H. Gilbert, K. Humphrey, and R. C. Et-tees, Developmental aspects of crosslanguage speech perception, Child Development, vol.52, issue.1, p.28, 1981.

J. F. Werker and T. K. Hensch, Critical periods in speech perception : new directions, Annual Review of Psychology, vol.66, p.182, 2015.
DOI : 10.1146/annurev-psych-010814-015104

J. F. Werker, L. Polka, and J. E. Et-pegg, The conditioned head turn procedure as a method for testing infant speech perception, Early Development and Parenting, vol.6, issue.34, p.28, 1997.
DOI : 10.1002/(sici)1099-0917(199709/12)6:3/4<171::aid-edp156>3.3.co;2-8

J. F. Werker and R. C. Et-tees, Cross-language speech perception : Evidence for perceptual reorganization during the first year of life, Infant Behavior and Development, vol.7, issue.1, p.28, 1984.

G. Westermann and E. R. Miranda, A new model of sensorimotor coupling in the development of speech, Brain and Language, vol.89, issue.2, pp.393-400, 2004.

K. S. White, S. Peperkamp, C. Kirk, and J. L. Et-morgan, Rapid acquisition of phonological alternations by infants, Cognition, vol.107, issue.1, pp.238-265, 2008.
DOI : 10.1016/j.cognition.2007.11.012

URL : http://europepmc.org/articles/pmc2941201?pdf=render

S. M. Wilson and M. Iacoboni, Neural responses to non-native phonemes varying in producibility : Evidence for the sensorimotor nature of speech perception, NeuroImage, vol.33, issue.1, p.13, 2006.

S. M. Wilson, A. P. Saygin, M. I. Sereno, and M. Et-iacoboni, Listening to speech activates motor areas involved in speech production, Nature Neuroscience, vol.7, issue.7, p.13, 2004.
DOI : 10.1038/nn1263

D. M. Wolpert, R. C. Miall, and M. Et-kawato, Internal models in the cerebellum, Trends in Cognitive Sciences, vol.2, issue.9, p.67, 1998.

H. H. Yeung and J. F. Werker, Learning words' sounds before learning how words sound : 9month-olds use distinct objects as cues to categorize speech information, Cognition, vol.113, issue.2, p.35, 2009.

K. A. Yoshida, F. Pons, J. Maye, and J. F. Et-werker, Distributional phonetic learning at 10 months of age, Infancy, vol.15, issue.4, pp.420-433, 2010.

Y. Yoshikawa, M. Asada, K. Hosoda, and J. Et-koga, A constructivist approach to infants' vowel acquisition through mother-infant interaction, Connection Science, vol.15, issue.4, p.59, 2003.

M. Yu, C. Mo, Y. Li, and L. Et-mo, Distinct representations of syllables and phonemes in Chinese production : Evidence from fMRI adaptation, Neuropsychologia, vol.77, p.26, 2015.

A. A. Zekveld, D. J. Heslenfeld, J. M. Festen, and R. Et-schoonhoven, Top-down and bottom-up processes in speech comprehension, NeuroImage, vol.32, issue.4, pp.1826-1836, 2006.

Z. Zheng, Perceptual processing of auditory feedback during speech production and its neural substrates, 2012.

J. C. Ziegler and U. Goswami, Reading acquisition, developmental dyslexia, and skilled reading across languages : a psycholinguistic grain size theory, Psychological Bulletin, vol.131, issue.1, p.3, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00201391

A. Zolnay, R. Schluter, and H. Et-ney, Acoustic feature combination for robust speech recognition, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005), p.44, 2005.