Skip to Main content Skip to Navigation

Sparse coding for machine learning, image processing and computer vision

Abstract : We study in this thesis a particular machine learning approach to represent signals that that consists of modelling data as linear combinations of a few elements from a learned dictionary. It can be viewed as an extension of the classical wavelet framework, whose goal is to design such dictionaries (often orthonormal basis) that are adapted to natural signals. An important success of dictionary learning methods has been their ability to model natural image patches and the performance of image denoising algorithms that it has yielded. We address several open questions related to this framework: How to efficiently optimize the dictionary? How can the model be enriched by adding a structure to the dictionary? Can current image processing tools based on this method be further improved? How should one learn the dictionary when it is used for a different task than signal reconstruction? How can it be used for solving computer vision problems? We answer these questions with a multidisciplinarity approach, using tools from statistical machine learning, convex and stochastic optimization, image and signal processing, computer vision, but also optimization on graphs.
Document type :
Complete list of metadatas

Cited literature [247 references]  Display  Hide  Download
Contributor : Abes Star :  Contact
Submitted on : Tuesday, May 24, 2011 - 2:38:20 PM
Last modification on : Thursday, October 29, 2020 - 3:01:56 PM
Long-term archiving on: : Thursday, August 25, 2011 - 2:24:28 AM


Version validated by the jury (STAR)


  • HAL Id : tel-00595312, version 1



Julien Mairal. Sparse coding for machine learning, image processing and computer vision. General Mathematics [math.GM]. École normale supérieure de Cachan - ENS Cachan, 2010. English. ⟨NNT : 2010DENS0040⟩. ⟨tel-00595312⟩



Record views


Files downloads