E. Aganj, E. Aganj, J. P. Pons, F. Segonne, and R. Keriven, Spatio-Temporal Shape from Silhouette using Four-Dimensional Delaunay Meshing, 2007 IEEE 11th International Conference on Computer Vision, pp.1-8, 2007.
DOI : 10.1109/ICCV.2007.4409016

A. Agarwal and B. Triggs, Recovering 3D human pose from monocular images, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.1, pp.44-58, 2006.
DOI : 10.1109/TPAMI.2006.21

URL : https://hal.archives-ouvertes.fr/inria-00548619

J. K. Aggarwal and Q. Cai, Human Motion Analysis: A Review, Computer Vision and Image Understanding, vol.73, issue.3, pp.428-440, 1999.
DOI : 10.1006/cviu.1998.0744

J. K. Aggarwal and S. Park, Human motion: modeling and recognition of actions and interactions, Proceedings. 2nd International Symposium on 3D Data Processing, Visualization and Transmission, 2004. 3DPVT 2004., pp.640-647, 2004.
DOI : 10.1109/TDPVT.2004.1335299

M. Ahmad and S. W. Lee, HMM-based Human Action Recognition Using Multiview Image Sequences, 18th International Conference on Pattern Recognition (ICPR'06), pp.263-266, 2006.
DOI : 10.1109/ICPR.2006.630

S. Ali, A. Basharat, and M. Shah, Chaotic Invariants for Human Action Recognition, 2007 IEEE 11th International Conference on Computer Vision, p.136, 2007.
DOI : 10.1109/ICCV.2007.4409046

J. Aloimonos, Purposive and qualitative active vision, International Conference on Pattern Recognition, pp.346-360, 1990.

J. Alon, V. Athitsos, and S. Sclaroff, Accurate and Efficient Gesture Spotting via Pruning and Subgesture Reasoning, International Workshop on Human-Computer Interaction, pp.189-198, 2005.
DOI : 10.1007/11573425_19

O. Arikan, D. A. Forsyth, O. 'brien, and J. F. , Motion synthesis from annotations, ACM Transactions on Graphics, vol.22, issue.3, pp.402-408, 2003.
DOI : 10.1145/882262.882284

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.133.6928

V. Athitsos and S. Sclaroff, Estimating 3D hand pose from a cluttered image, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., pp.432-439, 2003.
DOI : 10.1109/CVPR.2003.1211500

J. R. Bellegarda and D. Nahamoo, Tied mixture continuous parameter modeling for speech recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.38, issue.12, pp.2033-2045, 1990.
DOI : 10.1109/29.61531

S. Belongie and J. Malik, Matching with shape contexts, 2000 Proceedings Workshop on Content-based Access of Image and Video Libraries, pp.20-26, 2000.
DOI : 10.1109/IVL.2000.853834

A. Bissacco, A. Chiuso, Y. Ma, and S. Soatto, Recognition of human gaits, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, pp.52-57, 2001.
DOI : 10.1109/CVPR.2001.990924

M. Blank, L. Gorelick, E. Shechtman, M. Irani, and R. Basri, Actions as space-time shapes, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.1395-1402, 2005.
DOI : 10.1109/ICCV.2005.28

A. Bobick and J. Davis, An appearance-based representation of action, Proceedings of 13th International Conference on Pattern Recognition, pp.307-342, 1996.
DOI : 10.1109/ICPR.1996.546039

A. Bobick and J. Davis, Real-time recognition of activity using temporal templates, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96, pp.39-42, 1996.
DOI : 10.1109/ACV.1996.571995

A. F. Bobick and J. W. Davis, The recognition of human movement using temporal templates, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.23, issue.3, pp.257-267, 2001.
DOI : 10.1109/34.910878

A. F. Bobick and A. D. Wilson, A state-based approach to the representation and recognition of gesture, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, issue.12, pp.1325-1337, 1997.
DOI : 10.1109/34.643892

A. Bobick and Y. Ivanov, Action recognition using probabilistic parsing, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231), pp.196-202, 1998.
DOI : 10.1109/CVPR.1998.698609

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.126.5607

R. Bodor, B. Jackson, O. Masoud, P. , and N. , Image-based reconstruction for view-independent human motion recognition, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453), pp.1548-1553, 2003.
DOI : 10.1109/IROS.2003.1248864

O. Boiman and M. Irani, Detecting irregularities in images and in video, IEEE International Conference on Computer Vision, pp.462-469, 2005.

M. Brand, N. Oliver, and A. Pentland, Coupled hidden Markov models for complex action recognition, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.994-999, 1997.
DOI : 10.1109/CVPR.1997.609450

M. Brand, Shadow puppetry, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1237-1244, 1999.
DOI : 10.1109/ICCV.1999.790422

M. Brand and V. Kettnaker, Discovery and segmentation of activities in video, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.8, pp.844-851, 2000.
DOI : 10.1109/34.868685

M. Bray, P. Kohli, T. , and P. H. , PoseCut: Simultaneous Segmentation and 3D Pose Estimation of Humans Using Dynamic Graph-Cuts, European Conference on Computer Vision, pp.642-655, 2006.
DOI : 10.1007/3-540-47977-5_5

C. Bregler, Learning and recognizing human dynamics in video sequences, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.568-574, 1997.
DOI : 10.1109/CVPR.1997.609382

L. W. Campbell and A. F. Bobick, Recognition of human body motion using phase space constraints, Proceedings of IEEE International Conference on Computer Vision, pp.624-630, 1995.
DOI : 10.1109/ICCV.1995.466880

L. W. Campbell, D. A. Becker, A. Azarbayejani, A. F. Bobick, and A. Pentland, Invariant features for 3-D gesture recognition, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp.157-163, 1996.
DOI : 10.1109/AFGR.1996.557258

C. Canton-ferrer, J. R. Casas, and M. Pardàs, Human model and motion based 3d action recognition in multiple view scenarios, European Signal Processing Conference, 2006.

S. Carlsson and J. Sullivan, Action recognition by shape matching to key frames, Workshop on Models versus Exemplars in Computer Vision. 43, pp.126-129, 2001.

C. Cedras and M. Shah, A survey of motion analysis from moving light displays, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR-94, pp.214-221, 1994.
DOI : 10.1109/CVPR.1994.323832

C. Cedras and M. Shah, Motion-based recognition a survey, Image and Vision Computing, vol.13, issue.2, pp.129-155, 1995.
DOI : 10.1016/0262-8856(95)93154-K

Q. Chen, M. Defrise, and F. Deconinck, Symmetric phase-only matched filtering of Fourier-Mellin transforms for image registration and recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.16, issue.12, pp.1156-1168, 1994.
DOI : 10.1109/34.387491

I. Cohen and H. Li, Inference of human postures by classification of 3D human body shape, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443), pp.74-81, 2003.
DOI : 10.1109/AMFG.2003.1240827

D. Cremers, T. Kohlberger, and C. Schnörr, Nonlinear shape statistics in mumfordshah based segmentation, European Conference on Computer Vision, pp.93-108, 2002.

R. Cutler and M. Turk, View-based interpretation of real-time optical flow for gesture recognition, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition, pp.416-421, 1998.
DOI : 10.1109/AFGR.1998.670984

J. Cutting and L. Kozlowski, Recognizing friends by their walk: Gait perception without familiarity cues, Bulletin of the Psychonomic Society, vol.18, issue.6, pp.353-356, 1977.
DOI : 10.3758/BF03337021

F. Cuzzolin, A. Sarti, and S. Tubaro, Action modeling with volumetric data, 2004 International Conference on Image Processing, 2004. ICIP '04., pp.881-884, 2004.
DOI : 10.1109/ICIP.2004.1419440

T. Darrell and A. Pentland, Space-time gestures, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp.335-340, 1993.
DOI : 10.1109/CVPR.1993.341109

J. W. Davis, Hierarchical motion history images for recognizing human motion, Proceedings IEEE Workshop on Detection and Recognition of Events in Video, pp.39-46, 2001.
DOI : 10.1109/EVENT.2001.938864

J. Deutscher, A. Blake, R. , and I. , Articulated body motion capture by annealed particle filtering, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), pp.126-133, 2000.
DOI : 10.1109/CVPR.2000.854758

P. Dollar, V. Rabaud, G. Cottrell, and S. Belongie, Behavior Recognition via Sparse Spatio-Temporal Features, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp.65-72, 2005.
DOI : 10.1109/VSPETS.2005.1570899

A. A. Efros, A. Berg, G. Mori, M. , and J. , Recognizing action at a distance, Proceedings Ninth IEEE International Conference on Computer Vision, pp.726-733, 2003.
DOI : 10.1109/ICCV.2003.1238420

A. M. Elgammal, V. D. Shet, Y. Yacoob, D. , and L. S. , Learning dynamics for exemplar-based gesture recognition, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., pp.571-578, 2003.
DOI : 10.1109/CVPR.2003.1211405

P. Felzenszwalb and D. Huttenlocher, Efficient matching of pictorial structures, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), pp.66-73, 2000.
DOI : 10.1109/CVPR.2000.854739

Z. Feng and T. J. Cham, Video-based human action classi.cation with ambiguous correspondences, IEEE Conference on Computer Vision and Pattern Recognition, pp.82-54, 2005.

M. Fischler and R. Elschlager, The Representation and Matching of Pictorial Structures, IEEE Transactions on Computers, vol.22, issue.1, pp.67-92, 1973.
DOI : 10.1109/T-C.1973.223602

W. Forstner and E. Gulch, A fast operator for detection and precise location of distinct points, corners and centres of circular features, Intercommission Conference on Fast Processing of Photogrammetric Data, pp.281-305, 1987.

D. Forsyth, Human motion tutorial: Activity recognition, Tutorial, vol.5, p.20, 2006.

D. Forsyth, O. Arikan, L. Ikemoto, J. O-'brien, and D. And-ramanan, Computational Studies of Human Motion: Part 1, Tracking and Motion Synthesis, Foundations and Trends?? in Computer Graphics and Vision, vol.1, issue.2/3, pp.77-254, 2005.
DOI : 10.1561/0600000005

Y. Freund and R. E. Schapire, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting, European Conference on Computational Learning Theory, pp.23-37, 1995.
DOI : 10.1006/jcss.1997.1504

B. J. Frey and N. Jojic, Learning graphical models of images, videos and their spatial transformations, UAI, pp.184-191, 2000.

D. M. Gavrila and L. S. Davis, 3-D model-based tracking of humans in action: a multi-view approach, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p.73, 1996.
DOI : 10.1109/CVPR.1996.517056

D. M. Gavrila, The Visual Analysis of Human Movement: A Survey, Computer Vision and Image Understanding, vol.73, issue.1, pp.82-98, 1999.
DOI : 10.1006/cviu.1998.0716

D. Gavrila and L. Davis, Towards 3-d model-based tracking and recognition of human movement, International Workshop on Face and Gesture Recognition, pp.272-277, 1995.

D. Gavrila and V. Philomin, Real-time object detection for "smart" vehicles, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.87-93, 1999.
DOI : 10.1109/ICCV.1999.791202

Z. Ghahramani, Learning dynamic Bayesian networks, Lecture Notes in Computer Science, vol.1387, pp.168-197, 1998.
DOI : 10.1007/BFb0053999

N. H. Goddard, The interpretation of visual motion: recognizing moving light displays, [1989] Proceedings. Workshop on Visual Motion, pp.212-220, 1989.
DOI : 10.1109/WVM.1989.47112

N. H. Goddard, The Perception of Articulated Motion: Recognizing Moving Light Displays, p.33, 1992.

A. E. Grace and M. Spann, A comparison between Fourier-Mellin descriptors and moment based features for invariant object recognition using neural networks, Pattern Recognition Letters, vol.12, issue.10, pp.635-643, 1991.
DOI : 10.1016/0167-8655(91)90018-H

G. H. Granlund, Fourier Preprocessing for Hand Print Character Recognition, IEEE Transactions on Computers, vol.21, issue.2, pp.195-201, 1972.
DOI : 10.1109/TC.1972.5008926

R. D. Green and L. Guan, Quantifying and Recognizing Human Movement Patterns From Monocular Video Images???Part I: A New Framework for Modeling Human Motion, IEEE Transactions on Circuits and Systems for Video Technology, vol.14, issue.2, pp.179-190, 2004.
DOI : 10.1109/TCSVT.2003.821976

A. Gritai, Y. Sheikh, and M. Shah, On the use of anthropometry in the invariant analysis of human actions, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., pp.923-926, 2004.
DOI : 10.1109/ICPR.2004.1334410

G. Guerra-filho and Y. Aloimonos, A Language for Human Action, Computer, vol.40, issue.5, pp.42-51, 2007.
DOI : 10.1109/MC.2007.154

Y. Guo, G. Xu, and S. Tsuji, Understanding human motion patterns, Proceedings of the 12th IAPR International Conference on Pattern Recognition (Cat. No.94CH3440-5), pp.325-329, 1994.
DOI : 10.1109/ICPR.1994.576929

Y. Guo, Y. Shan, H. Sawhney, and R. Kumar, PEET: Prototype Embedding and Embedding Transition for Matching Vehicles over Disparate Viewpoints, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.
DOI : 10.1109/CVPR.2007.383069

I. Guyon and A. Elisseeff, An introduction to variable and feature selection, Journal of Machine Learning Research, vol.3, issue.102, pp.1157-1182, 2003.

C. Harris and M. Stephens, A Combined Corner and Edge Detector, Procedings of the Alvey Vision Conference 1988, pp.147-152, 1988.
DOI : 10.5244/C.2.23

D. Heesch and S. M. Rueger, Combining Features for Content-Based Sketch Retrieval ??? A Comparative Evaluation of Retrieval Performance, Proceedings of the 24th BCS-IRSG European Colloquium on IR Research, pp.41-52, 2002.
DOI : 10.1007/3-540-45886-7_3

D. Hogg, Model-based vision: a program to see a walking person, Image and Vision Computing, vol.1, issue.1, pp.5-20, 1983.
DOI : 10.1016/0262-8856(83)90003-3

M. K. Hu, Visual pattern recognition by moment invariants, IRE Transactions on Information Theory, vol.8, issue.63, pp.179-187, 1962.

N. Ikizler and D. Forsyth, Searching Video for Complex Activities with Finite State Models, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.
DOI : 10.1109/CVPR.2007.383168

Y. A. Ivanov and A. F. Bobick, Recognition of visual activities and interactions by stochastic parsing, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.8, pp.852-872, 2000.
DOI : 10.1109/34.868686

O. Jenkins and M. Mataric, Deriving action and behavior primitives from human motion data, IEEE/RSJ International Conference on Intelligent Robots and System, pp.2551-2556, 2002.
DOI : 10.1109/IRDS.2002.1041654

H. Jhuang, T. Serre, L. Wolf, and T. Poggio, A biologically inspired system for action, IEEE International Conference on Computer Vision. 38, pp.132-146, 2007.

G. Johansson, Visual perception of biological motion and a model for its analysis, Perception & Psychophysics, vol.4, issue.2, pp.201-211, 1973.
DOI : 10.3758/BF03212378

G. H. John, R. Kohavi, and K. Pfleger, Irrelevant Features and the Subset Selection Problem, ICML, pp.121-129, 1994.
DOI : 10.1016/B978-1-55860-335-6.50023-4

N. Jojic, N. Petrovic, B. Frey, and T. Huang, Transformed hidden Markov models: estimating mixture models of images and inferring spatial transformations in video sequences, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), pp.26-33, 2000.
DOI : 10.1109/CVPR.2000.854728

I. Junejo, E. Dexter, I. Laptev, and P. Pérez, Cross-View Action Recognition from Temporal Self-similarities, European Conference on Computer Vision, p.28, 2008.
DOI : 10.1007/978-3-540-88688-4_22

URL : https://hal.archives-ouvertes.fr/inria-00289708

K. Kahol, P. Tripathi, S. Panchanathan, R. , and T. , Gesture segmentation in complex motion sequences, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429), pp.105-113, 2003.
DOI : 10.1109/ICIP.2003.1246627

M. Kazhdan, Shape Representations and Algorithms for 3D Model Retrieval, p.44, 2004.

M. Kazhdan, T. Funkhouser, R. , and S. , Rotation invariant spherical harmonic representation of 3d shape descriptors, Eurographics Symposium on Geometry Processing, p.66, 2003.

Y. Ke, R. Sukthankar, and M. Hebert, Efficient visual event detection using volumetric features, IEEE International Conference on Computer Vision, pp.166-173, 2005.

Y. Ke, R. Sukthankar, H. , and M. , Event Detection in Crowded Videos, 2007 IEEE 11th International Conference on Computer Vision, p.57, 2007.
DOI : 10.1109/ICCV.2007.4409011

D. Knossow, R. Ronfard, and R. P. Horaud, Human Motion Tracking with a Kinematic Parameterization of??Extremal Contours, International Journal of Computer Vision, vol.48, issue.1, pp.247-269, 2008.
DOI : 10.1007/s11263-007-0116-2

URL : https://hal.archives-ouvertes.fr/inria-00104098

R. Kohavi and G. H. John, Wrappers for feature subset selection, Artificial Intelligence, vol.97, issue.1-2, pp.273-324, 1997.
DOI : 10.1016/S0004-3702(97)00043-X

A. Kojima, T. Tamura, and K. Fukunaga, Natural language description of human activities from video images based on concept hierarchy of actions, International Journal of Computer Vision, vol.50, issue.2, pp.171-184, 2002.
DOI : 10.1023/A:1020346032608

L. Kozlowski and J. Cutting, Recognizing the sex of a walker from a dynamic point-light display, Perception & Psychophysics, vol.38, issue.6, pp.575-580, 1977.
DOI : 10.3758/BF03198740

I. Laptev and T. Lindeberg, Space-time interest points, IEEE International Conference on Computer Vision, pp.432-439, 2003.
DOI : 10.1109/iccv.2003.1238378

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.4.4359

I. Laptev and P. Pérez, Retrieving actions in movies, 2007 IEEE 11th International Conference on Computer Vision, p.41, 2007.
DOI : 10.1109/ICCV.2007.4409105

A. Laurentini, The visual hull concept for silhouette-based image understanding, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.16, issue.2, pp.150-162, 1994.
DOI : 10.1109/34.273735

H. K. Lee and J. Kim, An hmm-based threshold model approach for gesture recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.21, issue.56, pp.961-973, 1999.

L. J. Li and L. Fei-fei, What, where and who? Classifying events by scene and object recognition, 2007 IEEE 11th International Conference on Computer Vision, pp.1-8, 2007.
DOI : 10.1109/ICCV.2007.4408872

J. Liu and M. Shah, Learning human actions via information maximization, IEEE Conference on Computer Vision and Pattern Recognition, p.28, 2008.

C. Lo and H. Don, 3-D moment forms: their construction and application to object identification and positioning, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.11, issue.10, pp.1053-1064, 1989.
DOI : 10.1109/34.42836

D. G. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

F. Lv and R. Nevatia, Single View Human Action Recognition using Key Pose Matching and Viterbi Path Searching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.
DOI : 10.1109/CVPR.2007.383131

F. Lv and R. Nevatia, Recognition and Segmentation of 3-D Human Action Using HMM and Multi-class AdaBoost, European Conference on Computer Vision, pp.359-372, 2006.
DOI : 10.1007/11744085_28

D. Marr and H. K. Nishihara, Representation and Recognition of the Spatial Organization of Three-Dimensional Shapes, Proceedings of the Royal Society B: Biological Sciences, vol.200, issue.1140, pp.269-294, 1140.
DOI : 10.1098/rspb.1978.0020

D. Marr and L. Vaina, Representation and Recognition of the Movements of Shapes, Proceedings of the Royal Society B: Biological Sciences, vol.214, issue.1197, pp.501-524, 1982.
DOI : 10.1098/rspb.1982.0024

O. Masoud and N. Papanikolopoulos, A method for human action recognition, Image and Vision Computing, vol.21, issue.8, pp.729-743, 2003.
DOI : 10.1016/S0262-8856(03)00068-4

S. Mccloud, Understanding Comics: The Invisible Art, p.106, 1993.

H. Meng, N. Pears, and C. Bailey, A Human Action Recognition System for Embedded Computer Vision Application, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-6, 2007.
DOI : 10.1109/CVPR.2007.383420

T. Minka, Exemplar-based likelihoods using the pdf projection theorem, 2004.

T. B. Moeslund and E. Granum, A Survey of Computer Vision-Based Human Motion Capture, Computer Vision and Image Understanding, vol.81, issue.3, pp.231-268, 2001.
DOI : 10.1006/cviu.2000.0897

T. B. Moeslund, A. Hilton, and V. Krûger, A survey of advances in vision-based human motion capture and analysis, Computer Vision and Image Understanding, vol.104, issue.2-3, pp.90-126, 2006.
DOI : 10.1016/j.cviu.2006.08.002

L. P. Morency, A. Quattoni, D. , and T. , Latent-Dynamic Discriminative Models for Continuous Gesture Recognition, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.
DOI : 10.1109/CVPR.2007.383299

P. Morguet and M. Lang, Spotting dynamic hand gestures in video image sequences using hidden Markov models, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269), pp.193-197, 1998.
DOI : 10.1109/ICIP.1998.999009

N. Nguyen, D. Phung, S. Venkatesh, and H. Bui, Learning and Detecting Activities from Movement Trajectories Using the Hierarchical Hidden Markov Models, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.955-960, 2005.
DOI : 10.1109/CVPR.2005.203

J. Niebles, H. Wang, H. Wang, F. Fei, and L. , Unsupervised learning of human action categories using spatial-temporal words, British Machine Vision Conference, pp.1249-1287, 2006.

J. C. Niebles and L. Fei-fei, A Hierarchical Model of Shape and Appearance for Human Action Classification, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.
DOI : 10.1109/CVPR.2007.383132

S. Niyogi and E. Adelson, Analyzing and recognizing walking figures in XYT, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition CVPR-94, pp.469-474, 1994.
DOI : 10.1109/CVPR.1994.323868

S. Nowozin, G. Bakir, and K. Tsuda, Discriminative Subsequence Mining for Action Classification, 2007 IEEE 11th International Conference on Computer Vision, p.44, 2007.
DOI : 10.1109/ICCV.2007.4409049

A. Ogale, A. Karapurkar, G. Guerra-filho, A. , and Y. , View-invariant identification of pose sequences for action recognition, VACE. 35, pp.40-50, 2004.

A. S. Ogale, A. Karapurkar, A. , and Y. , View-Invariant Modeling and Recognition of Human Actions Using Grammars, Workshop on Dynamical Vision, pp.115-126, 2005.
DOI : 10.1007/978-3-540-70932-9_9

N. M. Oliver, B. Rosario, and A. Pentland, A Bayesian computer vision system for modeling human interactions, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.8, pp.831-843, 2000.
DOI : 10.1109/34.868684

P. J. Otterloo, A contour-oriented approach to shape analysis, 1991.

V. Parameswaran and R. Chellappa, View invariants for human action recognition, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., pp.613-632, 2003.
DOI : 10.1109/CVPR.2003.1211523

V. Parameswaran and R. Chellappa, Human action-recognition using mutual invariants, Computer Vision and Image Understanding, vol.98, issue.2, pp.295-325, 2005.
DOI : 10.1016/j.cviu.2004.09.002

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.89.1326

V. Parameswaran and R. Chellappa, View Invariance for Human Action Recognition, International Journal of Computer Vision, vol.36, issue.3, pp.83-101, 2006.
DOI : 10.1007/s11263-005-3671-4

S. Park and J. K. Aggarwal, Recognition of two-person interactions using a hierarchical Bayesian network, First ACM SIGMM international workshop on Video surveillance , IWVS '03, pp.65-76, 2003.
DOI : 10.1145/982452.982461

P. Peursum, H. Bui, S. Venkatesh, and G. West, Human action segmentation via controlled use of missing data in HMMs, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., pp.440-445, 2004.
DOI : 10.1109/ICPR.2004.1333797

P. Peursum, G. West, and S. Venkatesh, Combining image regions and human activity for indirect object recognition in indoor wide-angle views, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.82-89, 2005.
DOI : 10.1109/ICCV.2005.57

P. Peursum, S. Venkatesh, and G. West, Tracking-as-recognition for articulated fullbody human motion analysis, IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.

M. Pierobon, M. Marcon, A. Sarti, and S. Tubaro, 3-D Body Posture Tracking For Human Action Template Matching, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, p.47, 2006.
DOI : 10.1109/ICASSP.2006.1660389

R. Polana and R. Nelson, Detecting activities, Proc. CVPR '93. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2-7, 1993.

R. Polana and R. Nelson, Low level recognition of human motion (or how to get your man without finding his body parts), Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects, p.37, 1994.
DOI : 10.1109/MNRAO.1994.346251

R. Polana and R. Nelson, Recognition of motion from temporal texture, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.129-134, 1992.
DOI : 10.1109/CVPR.1992.223216

R. Poppe and M. Poel, Comparison of Silhouette Shape Descriptors for Example-based Human Pose Recovery, 7th International Conference on Automatic Face and Gesture Recognition (FGR06), pp.541-546, 2006.
DOI : 10.1109/FGR.2006.32

L. R. Rabiner, A tutorial on hidden markov models and selected applications in speech recognition, Proceedings of the IEEE, pp.267-296, 1990.

D. Ramanan and D. A. Forsyth, Automatic annotation of everyday movements, p.34, 2003.

C. Rao, A. Gritai, M. Shah, and T. Syeda-mahmood, View-invariant alignment and matching of video sequences, Proceedings Ninth IEEE International Conference on Computer Vision, pp.939-945, 2003.
DOI : 10.1109/ICCV.2003.1238449

C. Rao, M. Shah, and T. Syeda-mahmood, Invariance in motion analysis of videos, Proceedings of the eleventh ACM international conference on Multimedia , MULTIMEDIA '03, pp.518-527, 2003.
DOI : 10.1145/957013.957125

C. Rao, A. Yilmaz, and M. Shah, View-invariant representation and recognition of actions, International Journal of Computer Vision, vol.50, issue.2, pp.203-226, 2002.
DOI : 10.1023/A:1020350100748

J. Rittscher and A. Blake, Classification of human body motion, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.634-639, 1999.
DOI : 10.1109/ICCV.1999.791284

N. Robertson and I. Reid, Behaviour understanding in video: a combined method, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.808-815, 2005.
DOI : 10.1109/ICCV.2005.47

G. Rogez, J. Guerrero, J. Martinez-del-rincon, O. Urunuela, and C. , Viewpoint Independent Human Motion Analysis in Man-made Environments, Procedings of the British Machine Vision Conference 2006, pp.659-704, 2006.
DOI : 10.5244/C.20.68

M. C. Roh, H. K. Shin, S. W. Lee, L. , and S. W. , Volume motion template for viewinvariant gesture recognition, International Conference on Pattern Recognition, pp.1229-1232, 2006.

J. Rohlicek, W. Russell, S. Roukos, and H. Gish, Continuous hidden Markov modeling for speaker-independent word spotting, International Conference on Acoustics, Speech, and Signal Processing, pp.627-630, 1989.
DOI : 10.1109/ICASSP.1989.266505

K. Rohr, Towards model-based recognition of human movements in image sequences, Graphical Model and Image Processing, vol.59, issue.1, pp.94-115, 1994.

R. Rosales and S. Sclaroff, Inferring body pose without tracking body parts, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), pp.721-727, 2000.
DOI : 10.1109/CVPR.2000.854946

C. Rose, M. Cohen, and B. Bodenheimer, Verbs and adverbs: multidimensional motion interpolation, IEEE Computer Graphics and Applications, vol.18, issue.5, pp.32-40, 1998.
DOI : 10.1109/38.708559

R. Rose and D. Paul, A hidden Markov model based keyword recognition system, International Conference on Acoustics, Speech, and Signal Processing, pp.129-132, 1990.
DOI : 10.1109/ICASSP.1990.115555

J. M. Rubin and W. A. Richards, Boundaries of visual motion, Massachusetts Institute of Technology, vol.53, p.81, 1985.

Y. Rui and P. Anandan, Segmenting visual actions based on spatio-temporal motion patterns, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), pp.1111-1118, 2000.
DOI : 10.1109/CVPR.2000.855807

F. Sadjadi and E. Hall, Three-Dimensional Moment Invariants, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.2, issue.2, pp.127-136, 1980.
DOI : 10.1109/TPAMI.1980.4766990

H. Sakoe and S. Chiba, Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.26, issue.1, pp.43-49, 1978.
DOI : 10.1109/TASSP.1978.1163055

C. Schmid, R. Mohr, and C. Bauckhage, Evaluation of interest point detectors, International Journal of Computer Vision, vol.37, issue.2, pp.151-172, 2000.
DOI : 10.1023/A:1008199403446

URL : https://hal.archives-ouvertes.fr/inria-00548302

C. Schuldt, I. Laptev, and B. Caputo, Recognizing human actions: a local SVM approach, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., pp.32-36, 2004.
DOI : 10.1109/ICPR.2004.1334462

P. Scovanner, S. Ali, and M. Shah, A 3-dimensional sift descriptor and its application to action recognition, Proceedings of the 15th international conference on Multimedia , MULTIMEDIA '07, pp.357-360, 2007.
DOI : 10.1145/1291233.1291311

S. M. Seitz and C. R. Dyer, View-invariant analysis of cyclic motion, International Journal of Computer Vision, vol.25, issue.3, pp.231-251, 1997.
DOI : 10.1023/A:1007928103394

M. Sheikh and M. Shah, Exploring the space of a human action, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.144-149, 2005.
DOI : 10.1109/ICCV.2005.90

D. Shen and H. H. Ip, Discriminative wavelet shape descriptors for recognition of 2-D patterns, Pattern Recognition, vol.32, issue.2, pp.151-165, 1999.
DOI : 10.1016/S0031-3203(98)00137-X

H. Sidenbladh, M. J. Black, and D. J. Fleet, Stochastic Tracking of 3D Human Figures Using 2D Image Motion, European Conference on Computer Vision, pp.702-718, 2000.
DOI : 10.1007/3-540-45053-X_45

C. Sminchisescu, A. Kanaujia, Z. Li, and D. Metaxas, Conditional models for contextual human motion recognition, IEEE International Conference on Computer Vision, pp.1808-1815, 2005.

C. Sminchisescu and B. Triggs, Covariance scaled sampling for monocular 3D body tracking, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, pp.447-454, 2001.
DOI : 10.1109/CVPR.2001.990509

URL : https://hal.archives-ouvertes.fr/inria-00548273

P. Smith, N. Da-vitoria-lobo, and M. Shah, TemporalBoost for event recognition, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.733-740, 2005.
DOI : 10.1109/ICCV.2005.234

T. Starner and A. Pentland, Real-time american sign language recognition from video using hidden markov models, International Symposium on Computer Vision, pp.265-270, 1995.

S. Sumi, Upside-down Presentation of the Johansson Moving Light-Spot Pattern, Perception, vol.38, issue.3, pp.283-286, 1984.
DOI : 10.1068/p130283

D. L. Swets and J. Weng, Using discriminant eigenfeatures for image retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.18, issue.8, pp.831-836, 1996.
DOI : 10.1109/34.531802

T. Syeda-mahmood, M. Vasilescu, and S. Sethi, Recognizing action events from multiple viewpoints, Proceedings IEEE Workshop on Detection and Recognition of Events in Video, pp.64-72, 2001.
DOI : 10.1109/EVENT.2001.938868

J. B. Tenenbaum, V. Silva, and J. C. Langford, A Global Geometric Framework for Nonlinear Dimensionality Reduction, Science, vol.290, issue.5500, pp.2902319-2323, 2000.
DOI : 10.1126/science.290.5500.2319

C. Thurau, Behavior Histograms for Action Recognition and Human Detection, Workshop on HUMAN MOTION Understanding, Modeling, Capture and Animation, pp.299-312, 2007.
DOI : 10.1007/978-3-540-75703-0_21

C. Tomasi and T. Kanade, Shape and motion from image streams under orthography: a factorization method, International Journal of Computer Vision, vol.4, issue.1, pp.137-154, 1992.
DOI : 10.1007/BF00129684

K. Toyama and A. Blake, Probabilistic tracking in a metric space, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, pp.50-59, 2001.
DOI : 10.1109/ICCV.2001.937599

V. Vapnik, Statistical Learning Theory, 1998.

M. Vasilescu, Human motion signatures: analysis, synthesis, recognition, Object recognition supported by user interaction for service robots, pp.456-460, 2002.
DOI : 10.1109/ICPR.2002.1047975

A. Veeraraghavan, R. Chellappa, R. , and A. , The Function Space of an Activity, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), pp.959-968, 2006.
DOI : 10.1109/CVPR.2006.304

S. Vitaladevuni, V. Kellokumpu, D. , L. Wang, L. Suter et al., Action recognition using ballistic dynamics Recognizing human activities from silhouettes: Motion subspace and factorial discriminative graphical model, IEEE Conference on Computer Vision and Pattern Recognition IEEE Conference on Computer Vision and Pattern Recognition, pp.40-126, 2007.

S. B. Wang, A. Quattoni, L. P. Morency, D. Demirdjian, D. et al., Hidden conditional random fields for gesture recognition, IEEE Conference on Computer Vision and Pattern Recognition, pp.1521-1527, 2006.

T. S. Wang, H. Y. Shum, Y. Q. Xu, and N. N. Zheng, Unsupervised Analysis of Human Gestures, IEEE Pacific Rim Conference on Multimedia, pp.174-181, 2001.
DOI : 10.1007/3-540-45453-5_23

Y. Wang, H. Jiang, M. Drew, Z. N. Li, and G. Mori, Unsupervised Discovery of Action Classes, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.1654-1661, 2006.
DOI : 10.1109/CVPR.2006.321

Y. Wang, P. Sabzmeydani, and G. Mori, Semi-Latent Dirichlet Allocation: A Hierarchical Model for Human Action Recognition, Workshop on HUMAN MOTION Understanding, Modeling, Capture and Animation, p.44, 2007.
DOI : 10.1007/978-3-540-75703-0_17

A. R. Webb, Statistical Pattern Recognition, p.72, 2002.

D. Weinland and E. Boyer, Action recognition using exemplar-based embedding, 2008 IEEE Conference on Computer Vision and Pattern Recognition, p.29, 2008.
DOI : 10.1109/CVPR.2008.4587731

URL : https://hal.archives-ouvertes.fr/inria-00590256

D. Weinland, E. Boyer, R. , and R. , Action Recognition from Arbitrary Views using 3D Exemplars, 2007 IEEE 11th International Conference on Computer Vision, p.102, 2007.
DOI : 10.1109/ICCV.2007.4408849

URL : https://hal.archives-ouvertes.fr/inria-00544741

D. Weinland, R. Ronfard, and E. Boyer, Free viewpoint action recognition using motion history volumes, IEEE International Workshop on modeling People and Human Interaction, p.69, 2005.
DOI : 10.1016/j.cviu.2006.07.013

URL : https://hal.archives-ouvertes.fr/inria-00544629

D. Weinland, R. Ronfard, and E. Boyer, Automatic Discovery of Action Taxonomies from Multiple Views, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), p.29, 2006.
DOI : 10.1109/CVPR.2006.65

URL : https://hal.archives-ouvertes.fr/inria-00590216

D. Weinland, R. Ronfard, and E. Boyer, Free viewpoint action recognition using motion history volumes, Computer Vision and Image Understanding, vol.104, issue.2-3, pp.249-257, 2006.
DOI : 10.1016/j.cviu.2006.07.013

URL : https://hal.archives-ouvertes.fr/inria-00544629

A. Wilson and A. Bobick, Learning visual behavior for gesture analysis, Proceedings of International Symposium on Computer Vision, ISCV, pp.229-234, 1995.
DOI : 10.1109/ISCV.1995.477006

A. D. Wilson and A. F. Bobick, Parametric hidden Markov models for gesture recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.21, issue.9, pp.884-900, 1999.
DOI : 10.1109/34.790429

S. F. Wong, T. K. Kim, and R. Cipolla, Learning Motion Categories using both Semantic and Structural Information, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-6, 2007.
DOI : 10.1109/CVPR.2007.383332

Y. Yacoob and M. Black, Parameterized modeling and recognition of activities, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), pp.120-127, 1998.
DOI : 10.1109/ICCV.1998.710709

J. Yamato, J. Ohya, and K. Ishii, Recognizing human action in time-sequential images using hidden Markov model, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.379-385, 1992.
DOI : 10.1109/CVPR.1992.223161

P. Yan, S. M. Khan, and M. Shah, Learning 4d action feature models for arbitrary view action recognition, IEEE Conference on Computer Vision and Pattern Recognition, p.28, 2008.

M. H. Yang and N. Ahuja, Recognizing Hand Gestures Using Motion Trajectories, IEEE Conference on Computer Vision and Pattern Recognition, pp.472-512, 1999.
DOI : 10.1007/978-1-4615-1423-7_3

A. Yilmaz and M. Shah, Actions Sketch: A Novel Action Representation, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.984-989, 2005.
DOI : 10.1109/CVPR.2005.58

A. Yilmaz and M. Shah, Recognizing human actions in videos acquired by uncalibrated moving cameras, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.150-157, 2005.
DOI : 10.1109/ICCV.2005.201

C. T. Zahn and R. Z. Roskies, Fourier Descriptors for Plane Closed Curves, IEEE Transactions on Computers, vol.21, issue.3, pp.269-281, 1972.
DOI : 10.1109/TC.1972.5008949

L. Zelnik-manor and M. Irani, Event-based analysis of video, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, p.55, 2001.
DOI : 10.1109/CVPR.2001.990935

J. Zhang and Y. Zhuang, View-Independent Human Action Recognition by Action Hypersphere in Nonlinear Subspace, IEEE Pacific Rim Conference on Multimedia, pp.108-117, 2007.
DOI : 10.1007/978-3-540-77255-2_13

L. Zhao and L. Davis, Closely coupled object detection and segmentation, IEEE International Conference on Computer Vision, pp.454-461, 2005.

T. Zhao and R. Nevatia, 3D tracking of human locomotion: a tracking as recognition approach, Object recognition supported by user interaction for service robots, pp.546-551, 2002.
DOI : 10.1109/ICPR.2002.1044790

H. Zhong, J. Shi, and M. Visontai, Detecting unusual activity in video, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., pp.819-826, 2004.
DOI : 10.1109/CVPR.2004.1315249