Skip to Main content Skip to Navigation
Theses

Reformulation sémantique des requêtes pour la recherche d’information ad hoc sur le Web

Abstract : As a query expansion and reformulation solution, we are interested in the different ways the semantic could be used to translate users information need into a query. We define two types of concepts : those which we can identify in a semantic resource like an ontology, and the ones we extract from the collection of documents via pseudo relevance feedback procedure. We propose a semantic and mixed approach to query expansion and reformulation (ASMER) that allows to integrate these two types of concepts in an automatically modified query. Our approach considers many challenges, especially selective terms expansion, named entity treatment and query reformulation.Even though the precision is the evaluation criteria the most adapted to a web context, we also considered evaluating the recall to study the behavior of our model from different aspects. This choice led us to handle a different problem related to evaluating the recall in information retrieval. After realizing that actual measures don't satisfy our constraints, we proposed a new recall oriented measure (MOR) which considers the recall as a priority without ignoring the precision.Among other measures, MOR was considered to evaluate our approach ASMER on four web collection from the standard evaluation campaigns Inex and Trec. Our experiments showed that ASMER improves the precision of the non modified original queries. In most cases, our approach achieved statistically significant enhancements when compared to a state of the art query expansion method. In addition, ASMER retrieves the first relevant document in better ranks than the compared approaches, it also has slightly better recall according to the measure MOR.
Document type :
Theses
Complete list of metadatas

Cited literature [145 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01126932
Contributor : Abes Star :  Contact
Submitted on : Friday, March 6, 2015 - 10:59:16 PM
Last modification on : Wednesday, June 24, 2020 - 4:19:09 PM
Long-term archiving on: : Sunday, June 7, 2015 - 7:40:24 PM

File

2014EMSE0750.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01126932, version 1

Citation

Bissan Audeh. Reformulation sémantique des requêtes pour la recherche d’information ad hoc sur le Web. Autre. Ecole Nationale Supérieure des Mines de Saint-Etienne, 2014. Français. ⟨NNT : 2014EMSE0750⟩. ⟨tel-01126932⟩

Share

Metrics

Record views

1242

Files downloads

3999