Skip to Main content Skip to Navigation

Weakly supervised learning of deformable part models and convolutional neural networks for object detection

Yuxing Tang 1
1 imagine - Extraction de Caractéristiques et Identification
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : In this dissertation we address the problem of weakly supervised object detection, wherein the goal is to recognize and localize objects in weakly-labeled images where object-level annotations are incomplete during training. To this end, we propose two methods which learn two different models for the objects of interest. In our first method, we propose a model enhancing the weakly supervised Deformable Part-based Models (DPMs) by emphasizing the importance of location and size of the initial class-specific root filter. We first compute a candidate pool that represents the potential locations of the object as this root filter estimate, by exploring the generic objectness measurement (region proposals) to combine the most salient regions and “good” region proposals. We then propose learning of the latent class label of each candidate window as a binary classification problem, by training category-specific classifiers used to coarsely classify a candidate window into either a target object or a non-target class. Furthermore, we improve detection by incorporating the contextual information from image classification scores. Finally, we design a flexible enlarging-and-shrinking post-processing procedure to modify the DPMs outputs, which can effectively match the approximate object aspect ratios and further improve final accuracy. Second, we investigate how knowledge about object similarities from both visual and semantic domains can be transferred to adapt an image classifier to an object detector in a semi-supervised setting on a large-scale database, where a subset of object categories are annotated with bounding boxes. We propose to transform deep Convolutional Neural Networks (CNN)-based image-level classifiers into object detectors by modeling the differences between the two on categories with both image-level and bounding box annotations, and transferring this information to convert classifiers to detectors for categories without bounding box annotations. We have evaluated both our approaches extensively on several challenging detection benchmarks, e.g. , PASCAL VOC, ImageNet ILSVRC and Microsoft COCO. Both our approaches compare favorably to the state-of-the-art and show significant improvement over several other recent weakly supervised detection methods.
Complete list of metadatas

Cited literature [60 references]  Display  Hide  Download
Contributor : Abes Star :  Contact
Submitted on : Tuesday, June 13, 2017 - 1:21:07 PM
Last modification on : Thursday, November 21, 2019 - 2:13:05 AM
Document(s) archivé(s) le : Tuesday, December 12, 2017 - 12:35:35 PM


Version validated by the jury (STAR)


  • HAL Id : tel-01538307, version 1


Yuxing Tang. Weakly supervised learning of deformable part models and convolutional neural networks for object detection. Other. Université de Lyon, 2016. English. ⟨NNT : 2016LYSEC062⟩. ⟨tel-01538307⟩



Record views


Files downloads