Analyse des données évolutives : application aux données d'usage du Web

Alzennyr Gomes da Silva 1
1 AxIS - Usage-centered design, analysis and improvement of information systems
CRISAM - Inria Sophia Antipolis - Méditerranée , Inria Paris-Rocquencourt
Abstract : Nowadays, more and more organizations are becoming reliant on the Internet. The Web has become one of the most widespread platforms for information change and retrieval. The growing number of traces left behind user transactions (e.g. : customer purchases, user sessions, etc.) automatically increases the importance of usage data analysis. Indeed, the way in which a web site is visited can change over time. These changes can be related to some temporal factors (day of the week, seasonality, periods of special offer, etc.). By consequence, the usage models must be continuously updated in order to reflect the current behaviour of the visitors. Such a task remains difficult when the temporal dimension is ignored or simply introduced into the data description as a numeric attribute. It is precisely on this challenge that the present thesis is focused. In order to deal with the problem of acquisition of real usage data, we propose a methodology for the automatic generation of artificial usage data over which one can control the occurrence of changes and thus, analyse the efficiency of a change detection system. Guided by tracks born of some exploratory analyzes, we propose a tilted window approach for detecting and following-up changes on evolving usage data. In order measure the level of changes, this approach applies two external evaluation indices based on the clustering extension. The proposed approach also characterizes the changes undergone by the usage groups (e.g. appearance, disappearance, fusion and split) at each timestamp. Moreover, the refereed approach is totally independent of the clustering method used and is able to manage different kinds of data other than usage data. The effectiveness of this approach is evaluated on artificial data sets of different degrees of complexity and also on real data sets from different domains (academic, tourism and marketing).
Document type :
Theses
Complete list of metadatas

https://tel.archives-ouvertes.fr/tel-00445501
Contributor : Alzennyr da Silva <>
Submitted on : Friday, January 8, 2010 - 5:03:09 PM
Last modification on : Friday, May 25, 2018 - 12:02:04 PM
Long-term archiving on: Thursday, June 17, 2010 - 10:32:06 PM

Identifiers

  • HAL Id : tel-00445501, version 1

Collections

Citation

Alzennyr Gomes da Silva. Analyse des données évolutives : application aux données d'usage du Web. Informatique [cs]. Université Paris Dauphine - Paris IX, 2009. Français. ⟨tel-00445501⟩

Share

Metrics

Record views

1200

Files downloads

6826