Skip to Main content Skip to Navigation
Theses

Semi-parametric mixture models and applications to multiple testing

Abstract : In a multiple testing context, we consider a semiparametric mixture model with two components. One component is assumed to be known and corresponds to the distribution of p-values under the null hypothesis with prior probability p. The other component f is nonparametric and stands for the distribution under the alternative hypothesis. The problem of estimating the parameters p and f of the model appears from the false discovery rate control procedures. In the first part of this dissertation, we study the estimation of the proportion p. We discuss asymptotic efficiency results and establish that two different cases occur whether f vanishes on a non-empty interval or not. In the first case, we exhibit estimators converging at parametric rate, compute the optimal asymptotic variance and conjecture that no estimator is asymptotically efficient (i.e. attains the optimal asymptotic variance). In the second case, we prove that the quadratic risk of any estimator does not converge at parametric rate. In the second part of the dissertation, we focus on the estimation of the nonparametric unknown component f in the mixture, relying on a preliminary estimator of p. We propose and study the asymptotic properties of two different estimators for this unknown component. The first estimator is a randomly weighted kernel estimator. We establish an upper bound for its pointwise quadratic risk, exhibiting the classical nonparametric rate of convergence over a class of Holder densities. The second estimator is a maximum smoothed likelihood estimator. It is computed through an iterative algorithm, for which we establish a descent property. In addition, these estimators are used in a multiple testing procedure in order to estimate the local false discovery rate.
Complete list of metadatas

Cited literature [19 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00987035
Contributor : Abes Star :  Contact
Submitted on : Monday, May 12, 2014 - 12:39:10 PM
Last modification on : Monday, December 23, 2019 - 3:50:10 PM
Document(s) archivé(s) le : Tuesday, August 12, 2014 - 10:50:11 AM

Identifiers

  • HAL Id : tel-00987035, version 1

Collections

Citation

van Hanh Nguyen. Semi-parametric mixture models and applications to multiple testing. General Mathematics [math.GM]. Université Paris Sud - Paris XI, 2013. English. ⟨NNT : 2013PA112196⟩. ⟨tel-00987035⟩

Share

Metrics

Record views

321

Files downloads

550