]. Aggarwal-2011, M. Aggarwal, and . Ryoo, Human activity analysis, ACM Computing Surveys, vol.43, issue.3, pp.1-43, 2011.
DOI : 10.1145/1922649.1922653

R. Massimiliano-albanese, V. Chellappa, A. Moscato, V. Picariello, P. Subrahmanian et al., A Constrained Probabilistic Petri Net Framework for Human Activity Detection in Video, IEEE Transactions on Multimedia, vol.10, issue.6, pp.982-996, 2008.
DOI : 10.1109/TMM.2008.2001369

]. Allen, Maintaining knowledge about temporal intervals, Communications of the ACM, vol.26, issue.11, pp.832-843, 1983.
DOI : 10.1145/182.358434

F. James, G. Allen, and . Ferguson, Actions and Events in Interval Temporal Logic, Journal of Logic and Computation, vol.4, issue.5, pp.531-579, 1994.

]. G. Antonini-2004, J. Antonini, and . Thiran, Trajectories clustering in ICA space: an application to automatic counting of pedestrians in video sequences, ACIVS 2004, EPFL, 2004.

A. Basharat, A. Gritai, and M. Shah, Learning object motion patterns for anomaly detection and improved object detection, 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2008.
DOI : 10.1109/CVPR.2008.4587510

. Blank, . Gorelick, M. Shechtman, R. Irani, and . Basri, Actions as space-time shapes, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.1395-1402, 2005.
DOI : 10.1109/ICCV.2005.28

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.100.8218

M. David, . Blei, I. Michael, . Jordan, L. Thomas et al., Hierarchical Topic Models and the Nested Chinese Restaurant Process, Advances in Neural Information Processing Systems, pp.1-8

M. David and . Blei, Introduction to Probabilistic Topic Models, Communications of the ACM, vol.2011, pp.1-16, 2011.

F. Aaron, A. D. Bobick, and . Wilson, A State-Based Approach to the Representation and Recognition of Gesture, IEEE Trans. Pattern Anal. Mach. Intell, vol.19, issue.12, pp.1325-1337, 1997.

]. Bobick and J. W. Davis, The recognition of human movement using temporal templates, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.23, issue.3, pp.257-267, 2001.
DOI : 10.1109/34.910878

A. Borzin, E. Rivlin, and M. Rudzsky, Borzin:kx, International Workshop on Content-Based Multimedia Indexing, pp.33-39, 2007.

. Brdiczka, J. Langet, J. L. Maisonnasse, and . Crowley, Detecting Human Behavior Models From Multimodal Observation in a Smart Home, IEEE Transactions on Automation Science and Engineering, vol.6, issue.4, pp.588-597, 2009.
DOI : 10.1109/TASE.2008.2004965

R. Calderara, A. Cucchiara, and . Prati, Detection of abnormal behaviors using a mixture of Von Mises distributions, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance, 2007.
DOI : 10.1109/AVSS.2007.4425300

]. Campbell and A. Bobick, Recognition of human body motion using phase space constraints, Proceedings of IEEE International Conference on Computer Vision, pp.624-630, 1995.
DOI : 10.1109/ICCV.1995.466880

. Carreira-perpinan, Gaussian Mean-Shift Is an EM Algorithm, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.5, pp.767-776, 2007.
DOI : 10.1109/TPAMI.2007.1057

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.87.1376

P. Comaniciu and . Meer, Mean shift: a robust approach toward feature space analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.5, pp.603-619, 2002.
DOI : 10.1109/34.1000236

]. Ventura, J. Silva, and . Ferreira, Ranking and Extraction of Relevant Single Words in Text, pp.1-20, 2011.
DOI : 10.5772/6035

P. Dai, H. Di, L. Dong, L. Tao, and G. Xu, Group Interaction Analysis in Dynamic Context. Systems, Man, and Cybernetics, Part B IEEE Transactions on, vol.38, issue.1, pp.275-282, 2008.

D. Damen and . Hogg, Recognizing linked events: Searching the space of feasible explanations, 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.927-934, 2009.
DOI : 10.1109/CVPR.2009.5206636

. Dollar, G. Rabaud, S. Cottrell, and . Belongie, Behavior Recognition via Sparse Spatio-Temporal Features, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp.65-72, 2005.
DOI : 10.1109/VSPETS.2005.1570899

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.77.5712

. Efros, . Berg, and M. Mori, Recognizing action at a distance, Proceedings Ninth IEEE International Conference on Computer Vision, pp.726-733, 2003.
DOI : 10.1109/ICCV.2003.1238420

]. Emonet, J. Varadarajan, and J. Odobez, EmonetVaradarajanOdobez-CVPR-2011, Computer Vision and Pattern Recognition, vol.25, pp.1-8, 2011.

I. Faisal and . Bashir, Real-time motion trajectory-based indexing and retrieval of video sequences @ARTICLEBashir07real-timemotion, author = Faisal I. Bashir and Student Member and Ashfaq A. Khokhar and Senior Member and Dan Schonfeld and Senior Member, title = Real-time motion trajectorybased indexing and retrieval of video sequences, journal = IEEE Trans. Multimedia , year =, IEEE Trans. Multimedia, vol.9, issue.9, pp.58-65, 2007.

. Fashing, Mean shift is a bound optimization. Pattern Analysis and Machine, 2005.

L. Fukunaga and . Hostetler, The estimation of the gradient of a density function, with applications in pattern recognition, IEEE Transactions on Information Theory, vol.21, issue.1, pp.32-40, 1975.
DOI : 10.1109/TIT.1975.1055330

A. Galata, N. Johnson, and D. Hogg, Learning Variable-Length Markov Models of Behavior, Computer Vision and Image Understanding, vol.81, issue.3, pp.398-413, 2001.
DOI : 10.1006/cviu.2000.0894

]. N. Ghanem, D. Dementhon, D. Doermann, and L. Davis, Representation and Recognition of Events in Surveillance Video Using Petri Nets, 2004 Conference on Computer Vision and Pattern Recognition Workshop, pp.112-112, 2004.
DOI : 10.1109/CVPR.2004.430

. Gupta, J. Srinivasan, L. Shi, and . Davis, Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos, 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.2012-2019, 2009.
DOI : 10.1109/CVPR.2009.5206492

. Haag, Incremental recognition of traffic situations from video image sequences, Image and Vision Computing, vol.18, issue.2, pp.137-153, 2000.
DOI : 10.1016/S0262-8856(99)00021-9

W. Hu, X. Xiao, Z. Fu, T. Xie, S. Tan et al., A system for learning statistical motion patterns, IEEE Trans. Pattern Anal. Mach. Intell, vol.28, issue.28, pp.1450-1464, 2006.

]. Ivanov and A. Bobick, Recognition of visual activities and interactions by stochastic parsing, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.8, pp.852-872, 2000.
DOI : 10.1109/34.868686

]. Jain, Data clustering: a review, ACM Computing Surveys, vol.31, issue.3, pp.1-60, 2011.
DOI : 10.1145/331499.331504

]. Jiang, Y. Wu, and A. Katsaggelos, A Dynamic Hierarchical Clustering Method for Trajectory-Based Unusual Video Event Detection, IEEE Transactions on Image Processing, vol.18, issue.4, pp.907-913, 2009.
DOI : 10.1109/TIP.2008.2012070

H. Jonathan, G. Anthony, C. Cohn-david, H. Hogg-jonathan, and . Fernyhough, Generation of Semantic Regions from Image Sequences, 1996.

A. Katz, . Ford, B. Moskowitz, M. Jackson, and . Jaffe, Studies of Illness in the Aged, JAMA, vol.185, issue.12, pp.914-919, 1963.
DOI : 10.1001/jama.1963.03060120024016

]. Kitani, Y. Sato, and A. Sugimoto, Recovering the Basic Structure of Human Activities from a Video-Based Symbol String, 2007 IEEE Workshop on Motion and Video Computing (WMVC'07), pp.9-9, 2007.
DOI : 10.1109/WMVC.2007.34

D. Kuettel, D. Michael, L. Breitenstein, V. Van-gool, and . Ferrari, What's going on? Discovering spatio-temporal dependencies in dynamic scenes, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.1951-1958, 2010.
DOI : 10.1109/CVPR.2010.5539869

I. Laptev, M. Marszalek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2008.
DOI : 10.1109/CVPR.2008.4587756

URL : https://hal.archives-ouvertes.fr/inria-00548659

E. Gal-lavee, M. Rivlin, and . Rudzsky, Understanding Video Events: A Survey of Methods for Automatic Interpretation of Semantic Occurrences in Video, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol.39, issue.5, pp.489-504, 2009.
DOI : 10.1109/TSMCC.2009.2023380

]. Li, W. Hu, and W. Hu, A Coarse-to-Fine Strategy for Vehicle Motion Trajectory Clustering, ICPR '06, pp.591-594, 2006.

]. Li, W. Hu, and W. Hu, A Coarse-to-Fine Strategy for Vehicle Motion Trajectory Clustering, ICPR '06: Proceedings of the 18th International Conference on Pattern Recognition, pp.591-594, 2006.

]. Liu, S. Ali, and M. Shah, Recognizing human actions using multiple features, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1-8, 2008.

]. Liu and M. Shah, Learning human actions via information maximization, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1-8, 2008.

]. Lowe-1999 and . Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1150-1157, 1999.
DOI : 10.1109/ICCV.1999.790410

M. Lublinerman, O. Sznaier, and . Camps, Dynamics Based Robust Motion Segmentation, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), pp.1176-1184, 2006.
DOI : 10.1109/CVPR.2006.105

]. D. Makris and T. Ellis, Learning Semantic Scene Models From Observing Activity in Visual Surveillance, IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics), vol.35, issue.3, pp.397-408, 2005.
DOI : 10.1109/TSMCB.2005.846652

]. D. Makris and T. Ellis, Learning Semantic Scene Models From Observing Activity in Visual Surveillance, IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics), vol.35, issue.3, pp.397-408, 2005.
DOI : 10.1109/TSMCB.2005.846652

]. D. Makris and T. Ellis, Learning Semantic Scene Models From Observing Activity in Visual Surveillance, IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics), vol.35, issue.3, pp.397-408, 2005.
DOI : 10.1109/TSMCB.2005.846652

D. Christopher, H. Manning, and . Schutze, Foundations of Statistical Natural Language Processing, pp.1-704, 2005.

]. D. Minnen, I. Essa, and T. Starner, Expectation grammars: leveraging high-level expectations for activity recognition, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., pp.626-632, 2003.
DOI : 10.1109/CVPR.2003.1211525

]. Morris and M. Trivedi, A Survey of Vision-Based Trajectory Learning and Analysis for Surveillance, IEEE Transactions on Circuits and Systems for Video Technology, pp.1114-1127, 2008.
DOI : 10.1109/TCSVT.2008.927109

T. Brendan, . Morris, M. Mohan, and . Trivedi, Learning and Classification of Trajectories in Dynamic Scenes: A General Framework for Live Video Analysis. Advanced Video and Signal Based Surveillance, 2008.

M. Lawton and E. Brody, ASSESSMENT OF OLDER PEOPLE, Nursing Research, vol.19, issue.3, 1969.
DOI : 10.1097/00006199-197005000-00029

A. Nghiem, F. Brémond, and M. Thonnat, Controlling background subtraction algorithms for robust object detection, 3rd International Conference on Imaging for Crime Detection and Prevention (ICDP 2009), 2009.
DOI : 10.1049/ic.2009.0273

URL : https://hal.archives-ouvertes.fr/inria-00502932

N. Nguyen, D. Phung, S. Venkatesh, and H. Bui, Learning and Detecting Activities from Movement Trajectories Using the Hierarchical Hidden Markov Models, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.955-960, 2005.
DOI : 10.1109/CVPR.2005.203

]. , C. Niebles, H. Wang, and L. Fei-fei, Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words, International Journal of Computer Vision, vol.79, issue.3, pp.299-318, 2008.

]. N. Oliver, B. Rosario, and A. P. Pentland, A Bayesian computer vision system for modeling human interactions, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.8, pp.831-843, 2000.
DOI : 10.1109/34.868684

P. Sangho and J. Aggarwal, A hierarchical Bayesian network for event recognition of human actions and interactions, Multimedia Systems, vol.10, issue.2, pp.164-179, 2004.

]. L. Patino, H. Benhadda, E. Corvee, M. Bremond, and . Thonnat, Extraction of activity patterns on large video recordings, IET Computer Vision, vol.2, issue.2, pp.108-128, 2008.
DOI : 10.1049/iet-cvi:20070062

URL : https://hal.archives-ouvertes.fr/inria-00502826

]. C. Piciarelli and G. Foresti, On-line trajectory clustering for anomalous events detection, Pattern Recognition Letters, vol.27, issue.15, pp.1835-1842, 2006.
DOI : 10.1016/j.patrec.2006.02.004

]. C. Piciarelli and G. L. Foresti, On-line trajectory clustering for anomalous events detection, Pattern Recognition Letters, vol.27, issue.15, pp.1835-1842, 2006.
DOI : 10.1016/j.patrec.2006.02.004

]. Pinhanez and A. Bobick, Human action detection using PNF propagation of temporal constraints, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231), pp.898-904, 1998.
DOI : 10.1109/CVPR.1998.698711

]. F. Porikli, Learning object trajectory patterns by spectral clustering. Multimedia and Expo, IEEE International Conference on, pp.1171-1174, 2004.
DOI : 10.1109/icme.2004.1394427

L. Pusiol, F. Patino, M. Bremond, and . Thonnant, Optimizing Trajectories Clustering for Activity Recognition. 1st International workshop on machine learning for vision-based motion analysis (MLVMA 08) in association with ECCV, p.82, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00326718

]. Pusiol, F. Brémond, and M. Thonnat, Trajectory Based Activity Discovery, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance, pp.270-277, 2010.
DOI : 10.1109/AVSS.2010.15

URL : https://hal.archives-ouvertes.fr/inria-00504634

Y. Rapantzikos, S. Avrithis, and . Kollias, Dense saliency-based spatiotemporal feature points for action recognition, 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.1454-1461, 2009.
DOI : 10.1109/CVPR.2009.5206525

. Romdhane, N. Mulin, A. Zouba, . Derreumeaux, . Piano et al., Automatic Video Monitoring system for assessment of Alzheimer's Disease symptoms . JNHA -The Journal of Nutrition, Health and Aging Ms, vol.131, p.128, 2010.

]. Ross, Managing care through the air [remote health monitoring, IEEE Spectrum, vol.41, issue.12, pp.26-31, 2004.
DOI : 10.1109/MSPEC.2004.1363637

]. Ryoo and J. Aggarwal, Semantic Representation and Recognition of Continued and??Recursive Human Activities, International Journal of Computer Vision, vol.00, issue.1, pp.1-24, 2008.
DOI : 10.1007/s11263-008-0181-1

A. Silvio-savarese, J. C. Delpozo, L. Niebles, and . Fei-fei, Spatial-Temporal correlatons for unsupervised action classification, 2008 IEEE Workshop on Motion and video Computing, pp.1-8, 2008.
DOI : 10.1109/WMVC.2008.4544068

C. Schuldt, I. Laptev, and B. Caputo, Recognizing human actions: a local SVM approach, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., pp.32-36, 2004.
DOI : 10.1109/ICPR.2004.1334462

M. Shechtman and . Irani, Space-Time Behavior Based Correlation, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.405-412, 2005.
DOI : 10.1109/CVPR.2005.328

M. Sheikh and M. Shah, Exploring the space of a human action, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.144-149, 2005.
DOI : 10.1109/ICCV.2005.90

]. Shi and C. Tomasi, Good Features to Track, IEEE CVPR'94, pp.593-600, 1994.

Y. Shi, Y. Huang, D. Minnen, A. Bobick, and I. Essa, Propagation networks for recognition of partially ordered sequential action, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.862-869, 2004.

J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio et al., Real-time human pose recognition in parts from single depth images, CVPR 2011, pp.1297-1304, 2011.
DOI : 10.1109/CVPR.2011.5995316

]. J. Siskind, Grounding the Lexical Semantics of Verbs in Visual Perception using Force Dynamics and Event Logic, 2011.

]. Smeulders, . Worring, A. Santini, R. Gupta, and . Jain, Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.12, pp.1349-1380, 2000.
DOI : 10.1109/34.895972

. Sminchisescu, Z. Kanaujia, D. Li, and . Metaxas, Conditional models for contextual human motion recognition, Tenth IEEE International Conference on Computer Vision (ICCV'05), pp.1808-1815, 2005.

M. Stikic, T. Huynh, K. Van-laerhoven, and B. Schiele, ADL recognition based on the combination of RFID and accelerometer sensing, Second International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth), pp.258-263, 2008.

]. Sun, X. Wu, S. Yan, L. Cheong, T. Chua et al., Hierarchical spatio-temporal context modeling for action recognition, IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR Workshops), pp.2004-2011, 2009.

]. Teh, I. Michael, . Jordan, J. Matthew, . Beal et al., Sharing Clusters Among Related Groups hierarchical Dirichlet Processes, pp.1-8, 2005.

D. Son, . Tran, S. Larry, and . Davis, Event Modeling and Recognition Using Markov Logic Networks, Lecture Notes in Computer Science, vol.5303, 2008.

. Turaga, V. Chellappa, O. Subrahmanian, and . Udrea, Machine Recognition of Human Activities: A Survey, IEEE Transactions on Circuits and Systems for Video Technology, pp.1473-1488, 2008.
DOI : 10.1109/TCSVT.2008.2005594

A. Veeraraghavan, The Function Space of an Activity, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), pp.959-968, 2006.
DOI : 10.1109/CVPR.2006.304

N. Harini-veeraraghavan, P. Papanikolopoulos, and . Schrater, Learning Dynamic Event Descriptions in Image Sequences, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-6, 2006.
DOI : 10.1109/CVPR.2007.383075

]. Vu, F. Brémond, and M. Thonnat, Automatic Video Interpretation: A Novel Algorithm for Temporal Scenario Recognition, International Joint Conference on Artificial Intelligence (IJCAI), pp.1295-1302, 2003.

W. Liang and D. Suter, Recognizing Human Activities from Silhouettes: Motion Subspace and Factorial Discriminative Graphical Model, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.

]. Wang, X. Ma, and W. Grimson, Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.3, pp.539-555, 2009.
DOI : 10.1109/TPAMI.2008.87

Y. Wang and G. Mori, Human Action Recognition by Semilatent Topic Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.10, pp.1762-1774, 2009.
DOI : 10.1109/TPAMI.2009.43

. Shu-fai, R. Wong, and . Cipolla, Extracting Spatiotemporal Interest Points using Global Information, IEEE 11th International Conference on Computer Vision, pp.1-8, 2007.

J. Yamato, K. Ohya, and . Ishii, Recognizing human action in timesequential images using hidden Markov model, 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.379-385, 1992.

S. Zhu, Learning deformable action templates from cluttered videos, IEEE 12th International Conference on Computer Vision (ICCV), pp.1507-1514, 2009.

M. Yilma and . Shah, Recognizing human actions in videos acquired by uncalibrated moving cameras, Tenth IEEE International Conference on Computer Vision (ICCV'05), pp.150-157, 2005.

Y. and J. Aggarwal, Detection of fence climbing from monocular video, 18th International Conference on Pattern Recognition (ICPR'06), pp.375-378, 2006.

]. Zaidi, On temporal logic programming using Petri nets, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, vol.29, issue.3, pp.245-254, 1999.
DOI : 10.1109/3468.759269

M. Irani, Statistical analysis of dynamic actions, IEEE Trans. Pattern Anal. Mach. Intell, vol.28, issue.9, pp.1530-1535, 2006.

]. Zhang, D. Gatica-perez, S. Bengio, and I. Mccowan, Modeling individual and group actions in meetings with layered HMMs, IEEE Transactions on Multimedia, vol.8, issue.3, pp.509-520, 2006.
DOI : 10.1109/TMM.2006.870735

F. Zouba, M. Bremond, and . Thonnat, Multisensor Fusion for Monitoring Elderly Activities at Home, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009.
DOI : 10.1109/AVSS.2009.27

URL : https://hal.archives-ouvertes.fr/inria-00453036

D. Marcos, F. Zúñiga, M. Brémond, and . Thonnat, Realtime reliability measure-driven multi-hypothesis tracking using 2D and 3D features, EURASIP Journal on Advances in Signal Processing, vol.2011, issue.1, p.142, 2011.