Sparse coding for machine learning, image processing and computer vision

Abstract : We study in this thesis a particular machine learning approach to represent signals that that consists of modelling data as linear combinations of a few elements from a learned dictionary. It can be viewed as an extension of the classical wavelet framework, whose goal is to design such dictionaries (often orthonormal basis) that are adapted to natural signals. An important success of dictionary learning methods has been their ability to model natural image patches and the performance of image denoising algorithms that it has yielded. We address several open questions related to this framework: How to efficiently optimize the dictionary? How can the model be enriched by adding a structure to the dictionary? Can current image processing tools based on this method be further improved? How should one learn the dictionary when it is used for a different task than signal reconstruction? How can it be used for solving computer vision problems? We answer these questions with a multidisciplinarity approach, using tools from statistical machine learning, convex and stochastic optimization, image and signal processing, computer vision, but also optimization on graphs.
Document type :
Theses
Mathematics. École normale supérieure de Cachan - ENS Cachan, 2010. English. <NNT : 2010DENS0040>
Domain :


https://tel.archives-ouvertes.fr/tel-00595312
Contributor : ABES STAR <>
Submitted on : Tuesday, May 24, 2011 - 2:38:20 PM
Last modification on : Wednesday, February 11, 2015 - 11:08:23 AM

File

Mairal2010.pdf
fileSource_public_star

Identifiers

  • HAL Id : tel-00595312, version 1

Citation

Julien Mairal. Sparse coding for machine learning, image processing and computer vision. Mathematics. École normale supérieure de Cachan - ENS Cachan, 2010. English. <NNT : 2010DENS0040>. <tel-00595312>

Export

Share

Metrics

Consultation de
la notice

1351

Téléchargement du document

431