Skip to Main content Skip to Navigation

Segmentation and structuring of video documents for indexing applications

Abstract : Recent advances in telecommunications, collaborated with the development of image and video processing and acquisition devices has lead to a spectacular growth of the amount of the visual content data stored, transmitted and exchanged over Internet. Within this context, elaborating efficient tools to access, browse and retrieve video content has become a crucial challenge. In Chapter 2 we introduce and validate a novel shot boundary detection algorithm able to identify abrupt and gradual transitions. The technique is based on an enhanced graph partition model, combined with a multi-resolution analysis and a non-linear filtering operation. The global computational complexity is reduced by implementing a two-pass approach strategy. In Chapter 3 the video abstraction problem is considered. In our case, we have developed a keyframe representation system that extracts a variable number of images from each detected shot, depending on the visual content variation. The Chapter 4 deals with the issue of high level semantic segmentation into scenes. Here, a novel scene/DVD chapter detection method is introduced and validated. Spatio-temporal coherent shots are clustered into the same scene based on a set of temporal constraints, adaptive thresholds and neutralized shots. Chapter 5 considers the issue of object detection and segmentation. Here we introduce a novel spatio-temporal visual saliency system based on: region contrast, interest points correspondence, geometric transforms, motion classes’ estimation and regions temporal consistency. The proposed technique is extended on 3D videos by representing the stereoscopic perception as a 2D video and its associated depth
Complete list of metadatas
Contributor : Abes Star :  Contact
Submitted on : Thursday, July 11, 2013 - 5:04:00 PM
Last modification on : Sunday, December 22, 2019 - 1:07:46 AM
Document(s) archivé(s) le : Saturday, October 12, 2013 - 8:55:23 AM


Version validated by the jury (STAR)


  • HAL Id : tel-00843596, version 1


Ruxandra Georgina Tapu. Segmentation and structuring of video documents for indexing applications. Economics and Finance. Institut National des Télécommunications, 2012. English. ⟨NNT : 2012TELE0050⟩. ⟨tel-00843596⟩



Record views


Files downloads