MODELE DE GRAPHE ET MODELE DE LANGUE POUR LA RECONNAISSANCE DE SCENES VISUELLES

Trong-Ton Pham

Thèse Année : 2010

MODELE DE GRAPHE ET MODELE DE LANGUE POUR LA RECONNAISSANCE DE SCENES VISUELLES

VISUAL GRAPH MODELING AND RETRIEVAL: A LANGUAGE MODEL APPROACH FOR SCENE RECOGNITION

(1)

Trong-Ton Pham

Fonction : Auteur
PersonId : 903618

Modélisation et Recherche d’Information Multimédia [Grenoble]

Résumé

Image retrieval and categorization may need to consider several types of visual features and spatial information between them (e.g., different point of views of an image). This thesis presents a novel approach that exploits an extension of the language modeling approach from information retrieval to the problem of graph-based image retrieval and categorization. Such versatile graph model is needed to represent the multiple points of views of images. A language model is defined on such graphs to handle a fast graph matching. We present the experiments achieved with several instances of the proposed model on two collections of images: one composed of 3,849 touristic images and another composed of 3,633 images captured by a mobile robot. Experimental results show that using visual graph model (VGM) improves the accuracies of the results of the standard language model (LM) and outperforms the Support Vector Machine (SVM) method.

Nous présentons une nouvelle méthode pour exploiter la relation entre différents niveaux de représentation d'image afin de compléter le modèle de graphe visuel. Le modèle de graphe visuel est une extension du modèle de langue classique en recherche d'information. Nous utilisons des régions d'images et des points d'intérêts (associées automatiquement à des concepts visuels), ainsi que des relations entre ces concepts, lors de la construction de la représentation sous forme de graphe. Les résultats obtenus sur catégorisation de la collection RobotVision de la compétition d'ImageCLEF 2009 et la collection STOIC-101 montrent que (a) la procédure de l'induction automatique des concepts d'une image est efficace, et (b) l'utilisation des relations spatiales entre deux niveaux de représentation, en plus de concepts, permet d'améliorer le taux de reconnaissance.

Mots clés

Graph Theory Image Representation Information Retrieval Language Modeling Scene Recognition Robot Localization

Représentation de graphes recherche d'information modèle de langue reconnaissance de scène localisation

Domaines

Informatique [cs] Interface homme-machine [cs.HC]

Fichier principal

thesis_pham.pdf (3.02 Mo)

Trong-Ton Pham : Connectez-vous pour contacter le contributeur

https://theses.hal.science/tel-00599927

Soumis le : vendredi 1 juin 2012-16:16:40

Dernière modification le : jeudi 4 avril 2024-18:17:19

Archivage à long terme le : dimanche 2 septembre 2012-02:47:02

Dates et versions

tel-00599927 , version 1 (11-06-2011)

tel-00599927 , version 2 (01-06-2012)

Identifiants

HAL Id : tel-00599927 , version 2

Citer

Trong-Ton Pham. MODELE DE GRAPHE ET MODELE DE LANGUE POUR LA RECONNAISSANCE DE SCENES VISUELLES. Computer Science [cs]. Université de Grenoble, 2010. English. ⟨NNT : ⟩. ⟨tel-00599927v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS LIG LIG_TDCGE_MRIM LIG_SIDCH

244 Consultations

394 Téléchargements

MODELE DE GRAPHE ET MODELE DE LANGUE POUR LA RECONNAISSANCE DE SCENES VISUELLES

VISUAL GRAPH MODELING AND RETRIEVAL: A LANGUAGE MODEL APPROACH FOR SCENE RECOGNITION

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager