Modeling and visual recognition of human actions and interactions

Ivan Laptev

Hdr Année : 2013

Modeling and visual recognition of human actions and interactions

(1)

Ivan Laptev

Fonction : Auteur
PersonId : 865349

Models of visual object recognition and scene understanding

Résumé

This work addresses the problem of recognizing actions and interactions in realistic video settings such as movies and consumer videos. The first contribution of this thesis (Chapters 2 and 4) is concerned with new video representations for action recognition. We introduce local space-time descriptors and demonstrate their potential to classify and localize actions in complex settings while circumventing the difficult intermediate steps of person detection, tracking and human pose estimation. The material on bag-of-features action recognition in Chapter 2 is based on publications [L14, L22, L23] and is related to other work by the author [L6, L7, L8, L11, L12, L13, L16, L21]. The work on object and action localization in Chapter 4 is based on [L9, L10, L13, L15] and relates to [L1, L17, L19, L20]. The second contribution of this thesis is concerned with weakly-supervised action learning. Chap- ter 3 introduces methods for automatic annotation of action samples in video using readily-available video scripts. It addresses the ambiguity of action expressions in text and the uncertainty of tem- poral action localization provided by scripts. The material presented in Chapter 3 is based on publications [L4, L14, L18]. Finally Chapter 5 addresses interactions of people with objects and concerns modeling and recognition of object function. We exploit relations between objects and co-occurring human poses and demonstrate object recognition improvements using automatic pose estimation in challenging videos from YouTube. This part of the thesis is based on the publica- tion [L2] and relates to other work by the author [L3, L5].

Mots clés

computer vision action recognition video analysis

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

hdr_Ivan.pdf (14.03 Mo)

Minsu Cho : Connectez-vous pour contacter le contributeur

https://theses.hal.science/tel-01064540

Soumis le : mardi 16 septembre 2014-23:19:56

Dernière modification le : vendredi 19 avril 2024-16:18:57

Archivage à long terme le : mercredi 17 décembre 2014-11:21:29

Dates et versions

tel-01064540 , version 1 (16-09-2014)

Identifiants

HAL Id : tel-01064540 , version 1

Citer

Ivan Laptev. Modeling and visual recognition of human actions and interactions. Computer Vision and Pattern Recognition [cs.CV]. Ecole Normale Supérieure de Paris - ENS Paris, 2013. ⟨tel-01064540⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UNIV-RENNES1 CNRS INRIA IRISA THESES-ENS INRIA2 PSL UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

506 Consultations

378 Téléchargements

Modeling and visual recognition of human actions and interactions

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager