Feature selection for spatial point processes

Abstract : Recent applications such as forestry datasets involve the observations of spatial point pattern data combined with the observation of many spatial covariates. We consider in this thesis the problem of estimating a parametric form of the intensity function in such a context. This thesis develops feature selection procedures and gives some guarantees on their validity. In particular, we propose two different feature selection approaches: the lasso-type methods and the Dantzig selector-type procedures. For the methods considering lasso-type techniques, we derive asymptotic properties of the estimates obtained from estimating functions derived from Poisson and logistic regression likelihoods penalized by a large class of penalties. We prove that the estimates obtained from such procedures satisfy consistency, sparsity, and asymptotic normality. For the Dantzig selector part, we develop a modified version of the Dantzig selector, which we call the adaptive linearized Dantzig selector (ALDS), to obtain the intensity estimates. More precisely, the ALDS estimates are defined as the solution to an optimization problem which minimizes the sum of coefficients of the estimates subject to linear approximation of the score vector as a constraint. We find that the estimates obtained from such methods have asymptotic properties similar to the ones proposed previously using an adaptive lasso regularization term. We investigate the computational aspects of the methods developped using either lasso-type procedures or the Dantzig selector-type approaches. We make links between spatial point processes intensity estimation and generalized linear models (GLMs), so we only have to deal with feature selection procedures for GLMs. Thus, easier computational procedures are implemented and computationally fast algorithm are proposed. Simulation experiments are conducted to highlight the finite sample performances of the estimates from each of two proposed approaches. Finally, our methods are applied to model the spatial locations a species of tree in the forest observed with a large number of environmental factors.
Document type :
Theses
Complete list of metadatas

Cited literature [85 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01690838
Contributor : Abes Star <>
Submitted on : Tuesday, January 23, 2018 - 2:35:13 PM
Last modification on : Wednesday, June 20, 2018 - 8:09:35 PM
Long-term archiving on: Thursday, May 24, 2018 - 12:09:56 PM

File

CHOIRUDDIN_2017_archivage.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01690838, version 1

Collections

Citation

Achmad Choiruddin. Feature selection for spatial point processes. Complex Variables [math.CV]. Université Grenoble Alpes, 2017. English. ⟨NNT : 2017GREAM045⟩. ⟨tel-01690838⟩

Share

Metrics

Record views

269

Files downloads

307