Structuring of image databases for the suggestion of products for online advertising

Abstract : The topic of the thesis is the extraction and segmentation of clothing items from still images using techniques from computer vision, machine learning and image description, in view of suggesting non intrusively to the users similar items from a database of retail products. We firstly propose a dedicated object extractor for dress segmentation by combining local information with a prior learning. A person detector is applied to localize sites in the image that are likely to contain the object. Then, an intra-image two-stage learning process is developed to roughly separate foreground pixels from the background. Finally, the object is finely segmented by employing an active contour algorithm that takes into account the previous segmentation and injects specific knowledge about local curvature in the energy function.We then propose a new framework for extracting general deformable clothing items by using a three stage global-local fitting procedure. A set of template initiates an object extraction process by a global alignment of the model, followed by a local search minimizing a measure of the misfit with respect to the potential boundaries in the neighborhood. The results provided by each template are aggregated, with a global fitting criterion, to obtain the final segmentation.In our latest work, we extend the output of a Fully Convolution Neural Network to infer context from local units(superpixels). To achieve this we optimize an energy function,that combines the large scale structure of the image with the locallow-level visual descriptions of superpixels, over the space of all possiblepixel labellings. In addition, we introduce a novel dataset called RichPicture, consisting of 1000 images for clothing extraction from fashion images.The methods are validated on the public database and compares favorably to the other methods according to all the performance measures considered.
Complete list of metadatas

Cited literature [78 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01683123
Contributor : Abes Star <>
Submitted on : Friday, January 12, 2018 - 6:26:09 PM
Last modification on : Saturday, December 21, 2019 - 3:50:38 AM
Long-term archiving on: Monday, May 7, 2018 - 12:20:49 PM

File

TheseYANG.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01683123, version 1

Collections

Citation

Lixuan Yang. Structuring of image databases for the suggestion of products for online advertising. Graphics [cs.GR]. Conservatoire national des arts et metiers - CNAM, 2017. English. ⟨NNT : 2017CNAM1102⟩. ⟨tel-01683123⟩

Share

Metrics

Record views

342

Files downloads

156