Skip to Main content Skip to Navigation
Theses

Modélisation de documents combinant texte et image : application à la catégorisation et à la recherche d'information multimédia

Abstract : Exploiting multimedia documents leads to representation problems of the textual and visual information within documents. Our goal is to propose a model to represent these both information and to combine them for two tasks: categorization and information retrieval. This model represents documents as bags of words, which requires to define adapted vocabularies. The textual vocabulary, usually very large, corresponds to the words of documents while the visual one is created by extracting low-level features from images. We study the different steps of its creation and the tf.idf weighting of visual words in images usually used for textual words. In the context of the text categorization, we introduce a criterion to select the most discriminative words for categories in order to reduce the vocabulary size without degrading the results of classification. We also present in the multilabel context, a method that lets us to select the number of categories which must be associated with a document. In multimedia information retrieval, we propose an analytical approach based on machine learning techniques to linearly combine the results from textual and visual information which significantly improves research results. Our model has shown its efficiency on different collections of important size and was evaluated in several international competitions such as XML Mining and ImageCLEF
Complete list of metadatas

https://tel.archives-ouvertes.fr/tel-00630438
Contributor : Abes Star :  Contact
Submitted on : Wednesday, May 2, 2012 - 2:02:17 PM
Last modification on : Monday, January 13, 2020 - 5:46:04 PM
Long-term archiving on: : Friday, August 3, 2012 - 2:43:25 AM

File

manuscritMoulin.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-00630438, version 2

Citation

Christophe Moulin. Modélisation de documents combinant texte et image : application à la catégorisation et à la recherche d'information multimédia. Modélisation et simulation. Université Jean Monnet - Saint-Etienne, 2011. Français. ⟨NNT : 2011STET4007⟩. ⟨tel-00630438v2⟩

Share

Metrics

Record views

716

Files downloads

249