Skip to Main content Skip to Navigation
Theses

Développement d’un indice de séparabilité adapté aux données de génomique en analyse de survie

Abstract : In oncogenomics research, one of the main objectives is to identify new genomic markers so as to construct predictive rules in order to classify patients according to time-to-event outcomes (death or tumor relapse). Most of the studies dealing with such high throughput data usually rely on a selection process in order to identify, among the candidates, the markers having a prognostic impact. A common problem among biologists is the choice of the selection rule. In survival analysis, classical procedures consist in ranking genetic markers according to either the estimated hazards ratio or quantities derived from a test statistic (p-value, q-value). However, these methods are not suitable for gene selection across multiple genomic datasets with different sample sizes.Using an index taking into account the magnitude of the prognostic impact of factors without being highly dependent on the sample size allows to address this issue. In this work, we propose a novel index of predictive ability for selecting genomic markers having a potential impact on timeto-event outcomes. This index extends the notion of "pseudo-R2" in the ramework of survival analysis. It possesses an original and straightforward interpretation in terms of "separability". The index is first derived in the framework of the Cox model and then extended to more complex non-proportional hazards models. Simulations show that our index is not substantially affected by the sample size of the study and the censoring. They also show that its separability performance is higher than indices from the literature. The interest of the index is illustrated in two examples. The first one aims at identifying genomic markers with common effects across different cancertypes. The second shows, in the framework of a lung cancer study, the interest of the index for selecting genomic factor with crossing hazards functions, which could be explained by some "modulating" effects between markers. The proposed index is a promising tool, which can help researchers to select a list of features of interest for further biological investigations.
Document type :
Theses
Complete list of metadatas

Cited literature [105 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00718743
Contributor : Abes Star :  Contact
Submitted on : Wednesday, July 18, 2012 - 10:02:18 AM
Last modification on : Wednesday, October 14, 2020 - 4:00:11 AM
Long-term archiving on: : Friday, October 19, 2012 - 2:35:09 AM

File

VA2_ROUAM_SIGRID_30_03_2011.pd...
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-00718743, version 1

Collections

STAR | CESP | UVSQ

Citation

Sigrid Laure Rouam. Développement d’un indice de séparabilité adapté aux données de génomique en analyse de survie. Santé publique et épidémiologie. Université Paris Sud - Paris XI, 2011. Français. ⟨NNT : 2011PA11T006⟩. ⟨tel-00718743⟩

Share

Metrics

Record views

643

Files downloads

591