A. Ce-jour, le domaine de l'identification par les lèvres est encore relativement vierge

. Hammal, Notre algorithme a déjà été intégré à un système de reconnaissance d'expression, d'après, 2003.

R. Amini, Using dynamic programming for solving variational problems in vision, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.12, issue.9, pp.855-867, 1990.
DOI : 10.1109/34.57681

]. P. Anandan, A computational framework and an algorithm for the measurement of visual motion, International Journal of Computer Vision, vol.27, issue.4, pp.283-310, 1989.
DOI : 10.1007/BF00158167

]. G. Bailly, Audiovisual speech synthesis, ETRW on Speech Synthesis, 2001.
URL : https://hal.archives-ouvertes.fr/hal-00169556

. Baron, Performance of optical flow techniques, International Journal of Computer Vision, vol.54, issue.1, pp.43-77, 1994.
DOI : 10.1007/BF01420984

. Belhumeur, Eigenfaces vs. Fisherfaces: recognition using class specific linear projection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, issue.7, pp.711-720, 1997.
DOI : 10.1109/34.598228

. Benoît, A set of French visemes for visual speech synthesis, Talking Machines: Theories, Models and Designs, pp.485-501, 1992.

M. O. Berger, R. Berger, and . Mohr, Towards autonomy in active contour models, [1990] Proceedings. 10th International Conference on Pattern Recognition, 1990.
DOI : 10.1109/ICPR.1990.118228

URL : https://hal.archives-ouvertes.fr/inria-00548463

. Beskow, The Teleface project -Multimodal Speech Communication for the Hearing Impaired, Proc. of Eurospeech '97, 1997.

]. J. Brand, Visual speech for speaker recognition and robust face detection, 2001.

. Caselles, Geodesic active contours, Proceedings of IEEE International Conference on Computer Vision, pp.694-699, 1995.
DOI : 10.1109/ICCV.1995.466871

. Chakraborty, Deformable boundary finding influenced by region homogenity, Proc. IEEE Conf. on Computer Vision and Pattren Recognition (CVPR), pp.624-627, 1994.
DOI : 10.1109/cvpr.1994.323790

C. , R. Chen, and R. R. Rao, Audio-Visual Integration in Multimodal Communication, IEEE Special Issue on Multimedia Signal Processing, 1998.

]. C. Chibelushi, Automatic Audio-Visual Person Recognition, 1997.

H. I. Chiou, J. Chiou, and . Hwang, Lipreading from color video, IEEE Transactions on Image Processing, vol.6, issue.8, pp.1192-1195, 1997.
DOI : 10.1109/83.605417

]. L. Cohen, Note on active contour models and ballons, Proc. CVGIP: Image Understanding, pp.211-218, 1991.

C. D. Cohen, I. Cohen, and . Cohen, Finite-element methods for active contour models and balloons for 2-D and 3-D images, IEEE Transactions Pattern analysis and Machine Intelligence, pp.1131-1147, 1993.
DOI : 10.1109/34.244675

. Cohen, Perception of Synthetic Visual Speech, D. Stork & M. Hennecke M editors, Speechreading by Humans and Machines: NATO ASI Series, pp.153-168, 1995.
DOI : 10.1007/978-3-662-13015-5_11

. Coianiz, 2D Deformable Models for Visual Speech Analysis, NATO Advanced Study Institute: Speech reading by Man and Machine, 1995.
DOI : 10.1007/978-3-662-13015-5_29

T. F. Cootes, C. J. Cootes, and . Taylor, Active Shape Models -«Smart Snakes», Proc. British Machine Vision Conference, pp.266-275, 1992.

. Cootes, Building and using flexible models incorporating grey-level information, 1993 (4th) International Conference on Computer Vision, pp.242-246, 1993.
DOI : 10.1109/ICCV.1993.378212

. Cootes, Active Shape Models-Their Training and Application, Computer Vision and Image Understanding, vol.61, issue.1, pp.38-59, 1995.
DOI : 10.1006/cviu.1995.1004

. Cootes, Active appearance models, Proc. European Conference on Computer Vision, pp.484-498, 1998.

. Cootes, Comparing Active Shape Models with Active Appearance Models, Procedings of the British Machine Vision Conference 1999, pp.173-182, 1999.
DOI : 10.5244/C.13.18

G. Cosatto, H. P. Cosatto, and . Graf, Samplebased of photo-realistic talking heads, Computer Animation. pp 103-110, 1998.

]. P. Daubias, Modèles a posteriori de la forme et de l'apparence des lèvres pour la reconnaissance automatique de la parole audiovisuelle, Thèse de doctorat, 2002.

]. P. Delmas, Extraction des contours de lèvres d'un visage parlant par contours actifs, Thèse de doctorat, 2000.

C. Dodd, R. Dodd, and . Campbell, Hearing by Eye: The Psychology of Lipreading, 1987.

B. D. Easton, M. Easton, and . Basala, Perceptual dominance during lipreading, Perception & Psychophysics, vol.3, issue.6, pp.562-570, 1982.
DOI : 10.3758/BF03204211

F. Ekman, W. Ekman, and . Friesen, Facial action coding system, 1978.

]. N. Erber, Interaction of Audition and Vision in the Recognition of Oral Speech Stimuli, Journal of Speech Language and Hearing Research, vol.12, issue.2, pp.423-425, 1969.
DOI : 10.1044/jshr.1202.423

]. I. Essa, Analysis, Interpretation and Synthesis of Facial Expressions, 1995.

E. , P. A. Essa, and A. Pentland, A vision system for observing and extracting facial action parameters, Proc. of Computer Vision and Pattern Recognition (CVPR 94), pp.76-83, 1994.

C. Etemad, R. Etemad, and . Chellappa, Discriminant Analysis fo Recognition of Human Face Images, Proc. AVBPA, Lecture Notes in Computer Science 1206, pp.127-142, 1997.

. Eveno, A parametric model for realistic lip segmentation, 7th International Conference on Control, Automation, Robotics and Vision, 2002. ICARCV 2002., 2002.
DOI : 10.1109/ICARCV.2002.1234982

. Faruquie, Large vocabulary audio-visual speech recognition using active shape models, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000, 2000.
DOI : 10.1109/ICPR.2000.903496

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.21.4247

M. E. Finn, A. A. Finn, and . Montgomery, Automatic optically-based recognition of speech, Pattern Recognition Letters, vol.8, issue.3, pp.159-164, 1988.
DOI : 10.1016/0167-8655(88)90094-3

]. C. Fisher, Confusions Among Visually Perceived Consonants, Journal of Speech Language and Hearing Research, vol.11, issue.4, pp.796-804, 1968.
DOI : 10.1044/jshr.1104.796

. Fleet, ]. D. Jepson, A. D. Fleet, and . Jepson, Computation of component image velocity from local phase information, International Journal of Computer Vision, vol.4, issue.1, pp.77-104, 1990.
DOI : 10.1007/BF00056772

B. Fua, C. Fua, and . Brechbuhler, Imposing hard constraint on sift snakes, Proc. European Conf. Computer Vision '96, pp.495-506, 1996.
DOI : 10.1007/3-540-61123-1_164

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.20.1557

. Gao, A deformable model for human organ extraction, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269), 1998.
DOI : 10.1109/ICIP.1998.999022

. Geiger, Dynamic programming for detecting, tracking, and matching deformable contours, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.17, issue.3, pp.294-302, 1993.
DOI : 10.1109/34.368194

]. A. Goldschen, Continuous optical automatic speech recognition by lipreading, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, 1993.
DOI : 10.1109/ACSSC.1994.471517

. Hammal, Extraction réaliste des traits caractéristiques du visage à l'aide de modèles paramétriques adaptés, Colloque GRETSI sur le traitement du signal et de images (GRETSI'03), 2003.

]. D. Heeger, Optical flow using spatiotemporal filters, International Journal of Computer Vision, vol.300, issue.5892, pp.279-302, 1988.
DOI : 10.1007/BF00133568

. Hennecke, Using deformable templates to infer visual speech dynamics, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, 1994.
DOI : 10.1109/ACSSC.1994.471518

]. E. Hildreth, The measurement of visual motion, 1984.

. Himer, Computer-Based Analysis of Facial Action: A New Approach, Journal of Psychophysiology, vol.5, issue.2, pp.189-195, 1991.

D. Horbelt, J. L. Horbelt, and . Dugelay, Active contours for lipreading -combining snakes with templates, GRETSI symposium on Signal and Image Processing, 1995.

]. B. Horn and B. G. Schunck, Determining optical flow, Artificial Intelligence, vol.17, issue.1-3, pp.185-204, 1981.
DOI : 10.1016/0004-3702(81)90024-2

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.185.1651

. Hsu, Face detection in color images, IEEE Trans. Pattern Analysis and Machine Intelligence, vol.24, issue.5, pp.696-706, 2002.

P. Hulbert, T. Hulbert, and . Poggio, Synthesizing a color algorithm from examples, Science, vol.239, issue.4839, pp.482-485, 1998.
DOI : 10.1126/science.3340834

. Jourlin, Acoustic-labial speaker verification, Pattern Recognition Letters, vol.18, issue.9, pp.319-334, 1997.
DOI : 10.1016/S0167-8655(97)00070-6

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.39.2620

. Kähler, Head shop, Proceedings of the 2002 ACM SIGGRAPH/Eurographics symposium on Computer animation , SCA '02, pp.55-64, 2002.
DOI : 10.1145/545261.545271

]. T. Koga, Motion compensated interframe coding for video conferencing. National telecommunication conference, 1981.

]. T. Lallouache, Un poste Visage-Parole. Acquisition and traitement automatique des contours des lèvres, Thèse de doctorat, 1991.

F. Lavagetto, Converting speech into lip movements: a multimedia telephone for hard of hearing people, IEEE Transactions on Rehabilitation Engineering, vol.3, issue.1, pp.1-14, 1995.
DOI : 10.1109/86.372898

]. B. Leroy, Modèles déformables et modèles de déformation appliqués à la reconnaissance de visages, Thèse de doctorat, 1996.

L. Leymarie, M. Leymarie, and . Levine, Tracking deformable objects in the plane using an active contour model, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.15, issue.6, pp.617-634, 1993.
DOI : 10.1109/34.216733

. Li, 3-D motion estimation in model-based facial image coding, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.15, issue.6, pp.545-555, 1993.
DOI : 10.1109/34.216724

]. J. Lien, Automatic Recognition of Facial Expressions Using Hidden Markov Models and Estimation of Expression Intensity, 1998.

L. Lievin, F. Lievin, and . Luthon, Unsupervised lip segmentation under natural conditions, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), pp.3065-3068, 1999.
DOI : 10.1109/ICASSP.1999.757488

URL : https://hal.archives-ouvertes.fr/hal-00961012

. Liew, Fuzzy segmentation of lip image using cluster analysis, European Conference on Speech Communication and Technology (EUROSPEECH'99), Hungary, 1999.

]. B. Lucas, Generalized Image Matching by the Method of Differences, 1984.

. Lucey, Chromatic lip tracking using a connectivity based fuzzy thresholding technique, ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359), 1999.
DOI : 10.1109/ISSPA.1999.815761

. Lucey, Face and lip tracking using chromatic based AVQ, 2000.

. Luettin, Active Shape Models for Visual Speech Feature Extraction, Electronic System Group Report N°95, 1995.
DOI : 10.1007/978-3-662-13015-5_28

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.416.5901

. Luettin, Visual speech recognition using active shape models and hidden Markov models, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, pp.817-820, 1996.
DOI : 10.1109/ICASSP.1996.543246

. Luettin, Speaker identification by lipreading, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.62-64, 1996.
DOI : 10.1109/ICSLP.1996.607030

]. J. Luettin, Visual Speech and Speaker Recognition, 1997.

. Lyons, Designing, Playing, and Performing with a Vision-Based Mouth Interface, Proc. Conference on New Interfaces for Musical Expression (NIME-03), pp.116-121, 2003.

P. Mase, A. Mase, and . Pentland, Automatic lipreading by optical-flow analysis, Systems and Computers in Japan, vol.17, issue.4, pp.67-75, 1991.
DOI : 10.1002/scj.4690220607

. Matthews, Lipreading using shape, shading and scale, Proc. Auditory-Visual Speech Processing (AVSP), pp.73-78, 1998.

. Matthews, A comparison of active shape model and scale decomposition based features for visual speech recognition, LNCS, vol.1407, pp.514-528, 1998.
DOI : 10.1007/BFb0054762

M. Mcgurk, J. Mcgurk, and . Mcdonald, Hearing lips and seeing voices, Nature, vol.65, issue.5588, pp.746-748, 1976.
DOI : 10.1038/264746a0

. Morishima, An intelligent facial image coding driven by speech and phoneme, International Conference on Acoustics, Speech, and Signal Processing, p.1795, 1989.
DOI : 10.1109/ICASSP.1989.266799

]. S. Morishima, Emotion model. International Workshop on Automatic Face and Gesture Recognition, pp.284-289, 1995.

]. Nagel, On the estimation of optical flow: Relations between different approaches and some new results, Artificial Intelligence, vol.33, issue.3, pp.299-324, 1987.
DOI : 10.1016/0004-3702(87)90041-5

. Nefian, A coupled HMM for audio-visual speech recognition, Proc. ICASSP, volume II, pp.2013-2016, 2002.

. Neuenschwander, Ziplock Snakes, International Journal of Computer Vision, vol.26, issue.3, pp.191-201, 1997.

]. S. Nishida, Speech recognition enhancement by lip information, ACM SIGCHI Bulletin, vol.17, issue.4, pp.198-204, 1986.
DOI : 10.1145/22339.22371

G. Bailly, Shape and appearance models of talking faces for model-based tracking. Auditory-visual Speech Processing Workshop, 2003.

. Oliver, LAFTER: a real-time face and lips tracker with facial expression recognition, Pattern Recognition, vol.33, issue.8, pp.1369-1382, 2000.
DOI : 10.1016/S0031-3203(99)00113-2

. Pantic, A hybrid approach to mouth features detection, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236), pp.1188-1193, 2001.
DOI : 10.1109/ICSMC.2001.973081

]. F. Parke, Parameterised models for facial animation, IEEE Computer Graphics and Applications, vol.12, pp.61-68, 1982.

. Patterson, Moving-Talker, Speaker-Independent Feature Study, and Baseline Results Using the CUAVE Multimodal Speech Corpus, EURASIP Journal on Advances in Signal Processing, vol.2002, issue.11, 2003.
DOI : 10.1155/S1110865702206101

. Petajan, An improved automatic lipreading system to enhance speech recognition, Proceedings of the SIGCHI conference on Human factors in computing systems , CHI '88, pp.19-25, 1988.
DOI : 10.1145/57167.57170

. Potamianos, Speaker independent audio-visual database for bimodal ASR, Proc. of the European Tutorial Workshop on Audio-Visual Speech Processing, 1997.

. Potamianos, Audiovisual automatic speech recognition, 2004.
DOI : 10.1017/CBO9780511843891.011

J. R. Rabiner, B. H. Rabiner, and . Juang, Fundamentals of Speech Recognition, 1993.

M. Radeva, E. Radeva, and . Marti, Facial features segmentation by model-based snakes, International Conference on Computer Analysis of Images and Patterns, 1995.

. Radeva, A snake for model-based segmentation, Proceedings of IEEE International Conference on Computer Vision, 1995.
DOI : 10.1109/ICCV.1995.466854

M. Rao, R. Rao, and . Mersereau, On merging hidden Markov models with deformable templates, Proceedings., International Conference on Image Processing, 1995.
DOI : 10.1109/ICIP.1995.537695

. Revéret, An hybrid approach to orientationfree liptracking, pp.97-117, 1997.

M. Rydfalk, CANDIDE: A Parameterized face Report LiTH-ISY-I-0866, 1987.

. Stork, Neural network lipreading system for improved speech recognition, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks, pp.285-295, 1992.
DOI : 10.1109/IJCNN.1992.226994

P. Sumby, W. Sumby, and I. Pollack, Visual Contribution to Speech Intelligibility in Noise, The Journal of the Acoustical Society of America, vol.26, issue.2, pp.212-215, 1954.
DOI : 10.1121/1.1907309

W. Terzopoulos, K. Terzopoulos, and . Waters, Physically-based facial modelling, analysis, and animation, The Journal of Visualization and Computer Animation, vol.12, issue.Washington, DC, pp.73-80, 1990.
DOI : 10.1002/vis.4340010208

]. D. Terzopoulos and K. Waters, Analysis and synthesis of facial image sequences using physical and anatomical models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.15, issue.6, pp.569-579, 1993.
DOI : 10.1109/34.216726

. Tian, Robust Lip Tracking by Combining Shape, Color and Motion, 4th Asian Conference on Computer Vision (ACCV'00), 2000.

. Tsapatsoulis, Efficient face detection for multimedia applications, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101), 2000.
DOI : 10.1109/ICIP.2000.899289

K. Tomasi, T. Tomasi, and . Kanade, Detection and tracking of point features, 1991.

. Uras, A computational approach to motion perception, Biological Cybernetics, vol.24, issue.2, pp.79-97, 1988.
DOI : 10.1007/BF00202895

. Vignoli, . Braccini, F. Vignoli, and C. Braccini, A textspeech synchronization technique with applications to talking heads, AVSP'99, 1999.

S. Wark, S. Wark, and . Sridharan, A syntatic approach to automatic lip feature extraction for speaker identification, ICASSP' 98, pp.3693-3696, 1998.
DOI : 10.1109/icassp.1998.679685

. Welsh, Synthetic face generation for enhancing a user interface, Proc. Image'Com Conf, pp.177-182, 1990.

]. G. Wolberg, Digital Image Warping, 1990.

P. Xu, J. L. Xu, and . Prince, Snakes, shapes and gradient vector flow, IEEE Transactions on Image Processing, vol.7, issue.3, pp.359-369, 1998.

D. Yacoob, L. Yacoob, and . Davis, Recognizing Human Facial Expression, 1994.

]. H. Yamada, Dimensions of visual information for categorizing facial expressions, Japanese Psychol. Res, vol.35, issue.4, pp.172-181, 1993.

Y. , W. Yang, and A. Waibel, A Real-Time Face Tracker, Proc. of WACV'96, pp.142-147, 1996.

. Zarit, Comparison of five color models in skin pixel classification, Proceedings International Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems. In Conjunction with ICCV'99 (Cat. No.PR00378), 1998.
DOI : 10.1109/RATFG.1999.799224

]. L. Zhang, Estimation of the mouth features using deformable templates, Proceedings of International Conference on Image Processing, pp.328-331, 1997.
DOI : 10.1109/ICIP.1997.632107

M. Zhang, R. M. Zhang, and . Mersereau, Lip feature extraction towards an automatic speechreading system, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101), 2000.
DOI : 10.1109/ICIP.2000.899336

. Eveno, Automatic and Accurate Lip Tracking, IEEE Transaction on circuits and video technology, 2004.

. Eveno, Jumping snakes and parametric model for lip segmentation, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429), 2003.
DOI : 10.1109/ICIP.2003.1246818

. Hammal, Extraction réaliste des traits caractéristiques du visage à l'aide de modèles paramétriques adaptés, Colloque GRETSI sur le traitement du Signal et des Images (GRETSI'03), 2003.

. Eveno, A parametric model for realistic lip segmentation, 7th International Conference on Control, Automation, Robotics and Vision, 2002. ICARCV 2002., 2002.
DOI : 10.1109/ICARCV.2002.1234982

. Eveno, Keypoints Based Segmentation of Lips, International Conference On Multimedia and Expo (ICME'02), 2002.

. Eveno, New color transformation for lips segmentation, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564), 2001.
DOI : 10.1109/MMSP.2001.962702

. Eveno, Vers l'Extraction Autotmatique des Lèvres d'un Visage Parlant, Colloque GRETSI sur le traitement du Signal et des Images (GRETSI'01), 2001.