Skip to Main content Skip to Navigation
Theses

Weakly supervised learning for visual recognition

Thibaut Durand 1
1 MLIA - Machine Learning and Information Access
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : This thesis studies the problem of classification of images, where the goal is to predict if a semantic category is present in the image, based on its visual content. To analyze complex scenes, it is important to learn localized representations. To limit the cost of annotation during training, we have focused on weakly supervised learning approaches. In this thesis, we propose several models that simultaneously classify and localize objects, using only global labels during training. The weak supervision significantly reduces the cost of full annotation, but it makes learning more challenging. The key issue is how to aggregate local scores - e.g. regions - into global score - e.g. image. The main contribution of this thesis is the design of new pooling functions for weakly supervised learning. In particular, we propose a “max + min” pooling function, which unifies many pooling functions. We describe how to use this pooling in the Latent Structured SVM framework as well as in convolutional networks. To solve the optimization problems, we present several solvers, some of which allow to optimize a ranking metric such as Average Precision. We experimentally show the interest of our models with respect to state-of-the-art methods, on ten standard image classification datasets, including the large-scale dataset ImageNet.
Document type :
Theses
Complete list of metadata

Cited literature [199 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01667325
Contributor : Abes Star :  Contact
Submitted on : Wednesday, November 15, 2017 - 10:15:08 AM
Last modification on : Friday, January 8, 2021 - 5:34:10 PM
Long-term archiving on: : Friday, February 16, 2018 - 12:54:44 PM

File

these_archivage_3274316o.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01667325, version 2

Citation

Thibaut Durand. Weakly supervised learning for visual recognition. Artificial Intelligence [cs.AI]. Université Pierre et Marie Curie - Paris VI, 2017. English. ⟨NNT : 2017PA066142⟩. ⟨tel-01667325v2⟩

Share

Metrics

Record views

1081

Files downloads

553