Skip to Main content Skip to Navigation
Theses

Classification sur données médicales à l'aide de méthodes d'optimisation et de datamining, appliquée au pré-screening dans les essais cliniques

Julie Jacques 1, 2, 3 
Abstract : Medical data suffer from uncertainty and a lack of uniformisation, making them hard to use in medical software, especially for patient screening in clinical trials. In this PhD work, we propose to deal with these problems using supervised classification methods. We will focus on 3 properties of these data : imbalance, uncertainty and volumetry. We propose the MOCA-I algorithm to cope with this partial classification combinatorial problem, that uses a multi-objective local search algorithm. After having confirmed the benefits of multiobjectivization in this context, we calibrate MOCA-I and compare it to the best algorithms of the literature, on both real data sets and imbalanced data sets from literature. MOCA-I generates rule sets that are statistically better than models obtained by the best algorithmes of the literature. Moreover, the models generated by MOCA-I are between 2 to 6 times shorter. Regarding balanced data, we propose the MOCA algorithm, statistically equivalent to best algorithms of literature. Then, we analyze both theoretically and experimentally the behaviors of MOCA and MOCA-I depending on imbalance. In order to help the decision maker to choose a solution and reduce over-fitting, we propose and evaluate different methods to handle all the Pareto solutions generated by MOCA-I. Finally, we show how this work can be integrated into a software application.
Complete list of metadata

https://tel.archives-ouvertes.fr/tel-00919876
Contributor : Julie Jacques Connect in order to contact the contributor
Submitted on : Tuesday, December 17, 2013 - 2:16:50 PM
Last modification on : Thursday, January 20, 2022 - 5:27:51 PM
Long-term archiving on: : Monday, March 17, 2014 - 11:16:23 PM

Identifiers

  • HAL Id : tel-00919876, version 1

Citation

Julie Jacques. Classification sur données médicales à l'aide de méthodes d'optimisation et de datamining, appliquée au pré-screening dans les essais cliniques. Apprentissage [cs.LG]. Université des Sciences et Technologie de Lille - Lille I, 2013. Français. ⟨tel-00919876⟩

Share

Metrics

Record views

558

Files downloads

199