Skip to Main content Skip to Navigation

2D/3D knowledge inference for intelligent access to enriched visual content

Abstract : This Ph.D. thesis tackles the issue of sill and video object categorization. The objective is to associate semantic labels to 2D objects present in natural images/videos. The principle of the proposed approach consists of exploiting categorized 3D model repositories in order to identify unknown 2D objects based on 2D/3D matching techniques. We propose here an object recognition framework, designed to work for real time applications. The similarity between classified 3D models and unknown 2D content is evaluated with the help of the 2D/3D description. A voting procedure is further employed in order to determine the most probable categories of the 2D object. A representative viewing angle selection strategy and a new contour based descriptor (so-called AH), are proposed. The experimental evaluation proved that, by employing the intelligent selection of views, the number of projections can be decreased significantly (up to 5 times) while obtaining similar performance. The results have also shown the superiority of AH with respect to other state of the art descriptors. An objective evaluation of the intra and inter class variability of the 3D model repositories involved in this work is also proposed, together with a comparative study of the retained indexing approaches . An interactive, scribble-based segmentation approach is also introduced. The proposed method is specifically designed to overcome compression artefacts such as those introduced by JPEG compression. We finally present an indexing/retrieval/classification Web platform, so-called Diana, which integrates the various methodologies employed in this thesis
Complete list of metadatas

Cited literature [132 references]  Display  Hide  Download
Contributor : Abes Star :  Contact
Submitted on : Thursday, December 12, 2013 - 4:57:09 PM
Last modification on : Wednesday, June 24, 2020 - 4:18:17 PM
Long-term archiving on: : Thursday, March 13, 2014 - 11:10:36 AM


Version validated by the jury (STAR)


  • HAL Id : tel-00917972, version 1


Raluca-Diana Sambra-Petre. 2D/3D knowledge inference for intelligent access to enriched visual content. Other [cs.OH]. Institut National des Télécommunications, 2013. English. ⟨NNT : 2013TELE0012⟩. ⟨tel-00917972⟩



Record views


Files downloads