Action Representation and Recognition

Daniel Weinland 1
1 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : Recognizing human actions is an important and challenging topic in computer vision, withmany important applications including video surveillance, video indexing and understanding of social interaction. From a computational perspective, actions can be defined as four-dimensional patterns, in space and in time. Such patterns can be modeled using several representations which differ from each other with respect to, among others, the visual information used, e.g. shape or appearance, the representation of dynamics, e.g. implicit or explicit, and the amount of invariance that the representation exhibits, e.g. a viewpoint invariance allowing to learn and recognize using different camera configurations. Our goal in this thesis is to develop a set of new techniques for action recognition. In the first part we present "Motion History Volumes", a free-viewpoint representation for human actions based on 3D visual-hull reconstructions computed form multiple calibrated, and backgroundsubtracted, video cameras. Results indicate that this representation can be used to learn and recognize basic human action classes, independently of gender, body size and viewpoint. We then present in the second part an approach based on a 3D exemplar-based HMM, which addresses the problem of recognizing actions from arbitrary views, even from a single camera. We will thus no longer require a 3D reconstruction during the recognition phase, instead we will use learned 3D models to produce 2D image information, which is compared to the observations. In the third and last part, we present a compact and efficient exemplar-based representation, which in particular does not attempt to encode the dynamics of an action through temporal dependencies. In experimental results we demonstrate that such a representation can precisely recognize actions, even with cluttered and non-background-segmented sequences.
Document type :
Theses
Complete list of metadatas

Cited literature [194 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00379318
Contributor : Daniel Weinland <>
Submitted on : Tuesday, April 28, 2009 - 11:35:56 AM
Last modification on : Wednesday, April 11, 2018 - 1:50:56 AM
Long-term archiving on : Thursday, June 10, 2010 - 10:06:22 PM

Identifiers

  • HAL Id : tel-00379318, version 1

Collections

Citation

Daniel Weinland. Action Representation and Recognition. Other [cs.OH]. Institut National Polytechnique de Grenoble - INPG, 2008. English. ⟨tel-00379318⟩

Share

Metrics

Record views

496

Files downloads

648