Skip to Main content Skip to Navigation
Theses

Découverte et exploitation d'objets visuels fréquents dans des collections multimédias

Pierre Letessier 1, 2
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Automatically linking multimedia documents that contain one or several instances of the same visual object has many applications including: salient events detection, relevant patterns discovery in scientific data or simply web browsing through hyper-visual links. Whereas efficient methods now exist for searching rigid objects in large collections, discovering them from scratch is still challenging in terms of scalability, particularly when the targeted objects are small compared to the whole image. In this PhD, we first revisited formally the problem of mining or discovering such objects, and then generalized two kinds of existing methods for probing candidate object seeds: weighted adaptive sampling and hashing based methods. We then introduced a new high-dimensional data hashing strategy, that works first at the visual level, and then at the geometric level. We conducted large-scale experiments on millions of images and on a new dedicated evaluation dataset (FlickrBelgaLogos.html) that we shared with the community. We did show that our method outperforms the reference method Geometric Min Hash. Based on this contribution, we then address the problem of suggesting object-based visual queries in a multimedia search engine. State-of-the-art visual search systems are usually based on the query-by-window paradigm: a user selects any image region containing an object of interest and the system returns a ranked list of images that are likely to contain other instances of the query object. User's perception of these tools is however affected by the fact that many submitted queries actually return nothing or only junk results (complex non-rigid objects, higher-level visual concepts, etc.).
Document type :
Theses
Complete list of metadatas

Cited literature [89 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00912992
Contributor : Alexis Joly <>
Submitted on : Monday, January 6, 2014 - 10:16:01 AM
Last modification on : Wednesday, October 9, 2019 - 11:44:04 AM
Document(s) archivé(s) le : Thursday, April 10, 2014 - 4:15:26 PM

Identifiers

  • HAL Id : tel-00912992, version 2

Collections

Citation

Pierre Letessier. Découverte et exploitation d'objets visuels fréquents dans des collections multimédias. Multimédia [cs.MM]. Telecom ParisTech, 2013. Français. ⟨tel-00912992v2⟩

Share

Metrics

Record views

752

Files downloads

564