Sélection de variables dans le modèle linéaire Report for the midterm review of project ClasSel References [Akaike 1970] H. Akaike. Statistical predictor identification, Annals of the Institute of Statistical Mathematics, vol.22, issue.1, pp.203-217, 1970. ,
Information theory and an extension of the maximum likelihood principle, Second International Symposium on Information Theory, pp.267-281 ,
A new look at the statistical model identification, IEEE Transactions on Automatic Control, vol.19, issue.6, pp.716-723, 1974. ,
DOI : 10.1109/TAC.1974.1100705
The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction, Technometrics, vol.23, issue.3, pp.125-127, 1974. ,
DOI : 10.2307/1267352
The DC (Difference of Convex Functions) Programming and DCA Revisited with DC Models of Real World Nonconvex Optimization Problems, Annals of Operations Research, vol.6, issue.1, pp.23-46, 2005. ,
DOI : 10.1007/s10479-004-5022-1
Scale mixtures of normal distributions, Journal of the Royal Statistical Society. Series B (Methodological), vol.36, pp.99-102, 1974. ,
Data-driven calibration of linear estimators with minimal penalties, Advances in Neural Information Processing Systems 22, pp.46-54, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00414774
A survey of cross-validation procedures for model selection, Statistics Surveys, vol.4, issue.0, pp.40-79, 2010. ,
DOI : 10.1214/09-SS054
URL : https://hal.archives-ouvertes.fr/hal-00407906
Data-driven calibration of penalties for leastsquares regression, Journal of Machine Learning Research, vol.10, pp.245-279, 2009. ,
URL : https://hal.archives-ouvertes.fr/inria-00287631
Optimization for Machine Learning, Chapter Convex optimization with sparsity-inducing norms, pp.19-54, 2011. ,
Gaussian model selection with an unknown variance, The Annals of Statistics, vol.37, issue.2, pp.630-672, 2009. ,
DOI : 10.1214/07-AOS573
URL : https://hal.archives-ouvertes.fr/hal-00756074
Universal approximation bounds for superpositions of a sigmoidal function, IEEE Transactions on Information Theory, vol.39, issue.3, pp.291-319, 1993. ,
DOI : 10.1109/18.256500
Approximation and estimation bounds for artificial neural networks, Machine Learning, pp.115-133, 1994. ,
Model Selection and Error Estimation, Machine Learning, pp.85-113, 2002. ,
DOI : 10.2139/ssrn.248567
Comparison of stopping rules in forward" stepwise" regression, Journal of the American Statistical Association, vol.72, pp.46-53, 1977. ,
Model selection via bilevel optimization, Neural Networks, 2006. IJCNN'06. International Joint Conference on, pp.1922-1929, 2006. ,
Bilevel Optimization and Machine Learning, Computational Intelligence: Research Frontiers, pp.25-47, 2008. ,
DOI : 10.1007/978-3-540-68860-0_2
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.224.7503
Statistical Decision Theory and Bayesian Analysis, pp.15-18, 1985. ,
DOI : 10.1007/978-1-4757-4286-2
Necessary Conditions for the CAPM, Journal of Economic Theory, vol.73, issue.1, pp.245-257, 1997. ,
DOI : 10.1006/jeth.1996.2218
Convex Analysis and Optimization . Athena Scientific optimization and computation series, Athena Scientific, 2003. ,
Choix de modèles en classification, 1997. ,
Gaussian model selection, Journal of the European Mathematical Society, vol.3, issue.3, pp.203-268, 2001. ,
DOI : 10.1007/s100970100031
Minimal penalties for Gaussian model selection . Probability Theory and Related Fields, pp.33-73, 2007. ,
Theory of Classification: a Survey of Some Recent Advances, ESAIM: Probability and Statistics, vol.9, pp.323-375, 2005. ,
DOI : 10.1051/ps:2005018
URL : https://hal.archives-ouvertes.fr/hal-00017923
Convex Optimization, 2004. ,
Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions, Psychometrika, vol.10, issue.2, pp.345-370, 1987. ,
DOI : 10.1007/BF02294361
Mixture-Model Cluster Analysis Using Model Selection Criteria and a New Informational Measure of Complexity, Proceedings of the First US/Japan Conference on the Frontiers of Statistical Modeling, pp.69-113, 1994. ,
DOI : 10.1007/978-94-011-0800-3_3
Akaike's Information Criterion and Recent Developments in Information Complexity, Journal of Mathematical Psychology, vol.44, issue.1, pp.62-91, 2000. ,
DOI : 10.1006/jmps.1999.1277
Generalizations of James-Stein Estimators Under Spherical Symmetry, The Annals of Statistics, vol.19, issue.3, pp.1639-1650, 1991. ,
DOI : 10.1214/aos/1176348267
Coordinate descent algorithms for nonconvex penalized regression, with applications to biological feature selection, The Annals of Applied Statistics, vol.5, issue.1, pp.232-2011, 2011. ,
DOI : 10.1214/10-AOAS388
Estimating Optimal Transformations for Multiple Regression and Correlation, Journal of the American Statistical Association, vol.41, issue.391, pp.580-598, 1985. ,
DOI : 10.1080/01621459.1985.10478157
Better Subset Regression Using the Nonnegative Garrote, Technometrics, vol.37, issue.4, pp.373-384, 1995. ,
DOI : 10.1080/01621459.1980.10477428
Heuristics of instability and stabilization in model selection, The Annals of Statistics, vol.24, issue.6, pp.2350-2383, 1996. ,
DOI : 10.1214/aos/1032181158
Fundamentals of Statistical Exponential Families: With Applications in Statistical Decision Theory. Lecture notes-monograph series, 1986. ,
Understanding WaveShrink: variance and bias estimation, Biometrika, vol.83, issue.4, pp.727-745, 1996. ,
DOI : 10.1093/biomet/83.4.727
Two-stage model selection procedures in partially linear regression, Canadian Journal of Statistics, vol.32, issue.2, pp.105-118, 2004. ,
Model Selection and Multimodel Inference: a Practical Information-Theoretic Approach, pp.29-37, 2002. ,
DOI : 10.1007/b97636
Model Selection for Simplicial Approximation, Foundations of Computational Mathematics, vol.33, issue.2, 2009. ,
DOI : 10.1007/s10208-011-9103-7
URL : https://hal.archives-ouvertes.fr/inria-00402091
Modern statistical estimation via oracle inequalities, Acta Numerica, vol.15, pp.257-326, 2006. ,
DOI : 10.1017/S0962492906230010
Monte-Carlo Tree Search: A new framework for game AI, Proceedings of the Fourth Artificial Intelligence and Interactive Digital Entertainment Conference, pp.216-217, 2008. ,
Comparison of Model Selection for Regression, Neural Computation, vol.15, issue.7, pp.1691-1714, 2003. ,
DOI : 10.1162/neco.1994.6.5.851
Learning from data: Concepts, Theory, and Methods, pp.12-14, 1998. ,
DOI : 10.1002/9780470140529
Model complexity control for regression using VC generalization bounds, IEEE Transactions on Neural Networks, vol.10, issue.5, pp.1075-1089, 1999. ,
DOI : 10.1109/72.788648
Elliptically Symmetric Distributions: A Review and Bibliography, International Statistical Review / Revue Internationale de Statistique, vol.49, issue.1, pp.67-74, 1981. ,
DOI : 10.2307/1403038
Model Selection and Model Averaging. Cambridge Series on Statistical and Probabilistic Mathematics, 2008. ,
Optimization and nonsmooth analysis, Classics In Applied Mathematics. Society for Industrial and Applied Mathematics, vol.5, 1990. ,
DOI : 10.1137/1.9781611971309
Ideal spatial adaptation by wavelet shrinkage, Biometrika, vol.81, issue.3, p.425, 1994. ,
DOI : 10.1093/biomet/81.3.425
Toward brain-computer interfacing, 2007. ,
Monte Carlo feature selection and interdependency discovery in supervised classification, Advances in Machine Learning II, pp.371-385, 2010. ,
Spherically Invariant Vector Random Fields in Space and Time, IEEE Transactions on Signal Processing, vol.59, issue.12, pp.5921-5929, 2011. ,
DOI : 10.1109/TSP.2011.2166391
Least angle regression (with discussions and authors reply), Annals of Statistics, vol.32, issue.42, pp.407-451, 2004. ,
How Biased is the Apparent Error Rate of a Prediction Rule?, Journal of the American Statistical Association, vol.39, issue.394, pp.461-470, 1986. ,
DOI : 10.1080/01621459.1986.10478291
Inadmissibility of sample mean and regression coefficients for elliptically contoured distributions, Northeastern Mathematical Journal, vol.1, pp.68-81, 1985. ,
Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties, Journal of the American Statistical Association, vol.96, issue.456, pp.1348-1360, 2001. ,
DOI : 10.1198/016214501753382273
Tuning parameter selection in high dimensional penalized likelihood, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.36, issue.3, 2012. ,
DOI : 10.1111/rssb.12001
Symmetric Multivariate and Related Distributions, of Monographs on Statistics and Applied Probability. Chapman & Hall/CRC, pp.72-77, 1989. ,
DOI : 10.1007/978-1-4899-2937-2
An Introduction to Probability Theory and its Applications, Series in Probability and Mathematical Statistics, 1966. ,
Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009. ,
DOI : 10.1016/j.sigpro.2007.01.024
Apprentissage statistique pour le signal: applications aux interfaces cerveau-machine, 2011. ,
URL : https://hal.archives-ouvertes.fr/tel-00687501
The Risk Inflation Criterion for Multiple Regression, The Annals of Statistics, vol.22, issue.4, pp.1947-1975, 1994. ,
DOI : 10.1214/aos/1176325766
On Bayes and unbiased estimators of loss, Annals of the Institute of Statistical Mathematics, vol.9, issue.2, pp.803-816, 2003. ,
DOI : 10.1007/BF02523394
Generalized Bayes minimax estimators of location vectors for spherically symmetric distributions, Journal of Multivariate Analysis, vol.99, issue.4, pp.735-750, 2008. ,
DOI : 10.1016/j.jmva.2007.03.007
Robust generalized Bayes minimax estimators of location vectors for spherically symmetric distribution with unknown scale. Borrowing Strength: Theory Powering Applications-A Festschrift for, pp.249-262, 2010. ,
Comparaisons de procédures de sélection d'un modèle de régression: une approche décisionnelle. Comptes rendus de l'Académie des sciences, Série 1, Mathématique, pp.865-870, 1994. ,
Estimation of a Loss Function for Spherically Symmetric Distributions in the General Linear Model, The Annals of Statistics, vol.23, issue.2, pp.571-592, 1995. ,
DOI : 10.1214/aos/1176324536
Loss Estimation for Spherically Symmetrical Distributions, Journal of Multivariate Analysis, vol.53, issue.2, pp.311-331, 1995. ,
DOI : 10.1006/jmva.1995.1039
On Improved Loss Estimation for Shrinkage Estimators, Statistical Science, vol.27, issue.1, pp.61-81, 2012. ,
DOI : 10.1214/11-STS380
Shrinkage estimation , 2012, pp.58-118 ,
From Statistics to Neural Networks Theory and Pattern Recognition Applications, Chapter An overview of predictive learning and function approximation, pp.1-61, 1994. ,
Recovering Sparse Signals With a Certain Family of Nonconvex Penalties and DC Programming, IEEE Transactions on Signal Processing, vol.57, issue.12, pp.4686-4698, 2009. ,
DOI : 10.1109/TSP.2009.2026004
URL : https://hal.archives-ouvertes.fr/hal-00439453
Feature selection as a one-player game, International Conference on Machine Learning, pp.359-366, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00484049
The Predictive Sample Reuse Method with Applications, Journal of the American Statistical Association, vol.36, issue.2, pp.320-328, 1975. ,
DOI : 10.1080/01621459.1975.10479865
The correlation structure of Matheron's classical variogram estimator under elliptically contoured distributions, Mathematical Geology, vol.32, issue.1, pp.127-137, 2000. ,
DOI : 10.1023/A:1007511019496
Calibration and empirical Bayes variable selection, Biometrika, vol.87, issue.4, pp.731-747, 2000. ,
DOI : 10.1093/biomet/87.4.731
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.18.3731
The Variable Selection Problem, Journal of the American Statistical Association, vol.7, issue.2, pp.1304-1308, 2000. ,
DOI : 10.1214/aos/1176349027
Statistical Methods for Spatio- Temporal Systems Chapter Geostatistical space-time models, stationarity, separability, and full symmetry, of Monographs on Statistics and Applied Probability, pp.151-175, 2007. ,
Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter, Technometrics, vol.5, issue.2, pp.215-223, 1979. ,
DOI : 10.1080/03610927508827223
Latent Block Model for Contingency Table, Communications in Statistics - Theory and Methods, vol.24, issue.3, pp.416-425, 2010. ,
DOI : 10.1016/j.csda.2007.09.007
URL : https://hal.archives-ouvertes.fr/hal-00447792
Elliptically Contoured Models in Statistics, 1993. ,
DOI : 10.1007/978-94-011-1646-6
Model selection: Beyond the bayesian/frequentist divide, Journal of Machine Learning Research, vol.11, pp.61-87, 2010. ,
Machine learning summer school, Chapter A practical guide to model selection, 2009. ,
SEMIPARAMETRIC MULTIVARIATE VOLATILITY MODELS, Econometric Theory, vol.52, issue.02, pp.251-280, 2007. ,
DOI : 10.1016/j.jimonfin.2006.04.006
Updating the Inverse of a Matrix, SIAM Review, vol.31, issue.2, pp.221-239, 1989. ,
DOI : 10.1137/1031049
The determination of the order of an autoregression, Journal of the Royal Statistical Society. Series B (Methodological), vol.41, issue.2, pp.190-195, 1979. ,
Computing proximal points of nonconvex functions, Mathematical Programming, pp.221-258, 2009. ,
DOI : 10.1007/s10107-007-0124-6
Generalized Additive Models, Chapman & Hall/CRC, 1990. ,
The Elements of Statistical Learning: Data mining, Inference and Prediction, 2005. ,
The Elements of Statistical Learning: Data Mining, Inference and Prediction, pp.14-18, 2008. ,
A Biometrics Invited Paper. The Analysis and Selection of Variables in Linear Regression, Biometrics, vol.32, issue.1, pp.1-49, 1976. ,
DOI : 10.2307/2529336
Ridge Regression: Applications to Nonorthogonal Problems, Technometrics, vol.3, issue.1, pp.69-82, 1970. ,
DOI : 10.2307/1266192
Robust Estimation of a Location Parameter, The Annals of Mathematical Statistics, vol.35, issue.1, pp.73-101, 1964. ,
DOI : 10.1214/aoms/1177703732
A Survey of Statistical Design and Linear Models, Chapter Robustness and designs, pp.287-303, 1975. ,
Robust Statistics, volume 67 of Wiley series in probability and mathematical statistics, 1981. ,
Regression and time series model selection in small samples, Biometrika, vol.76, issue.2, pp.297-307, 1989. ,
DOI : 10.1093/biomet/76.2.297
Bias of the corrected AIC criterion for underfitted regression and time series models, Biometrika, vol.78, issue.3, pp.499-509, 1991. ,
DOI : 10.1093/biomet/78.3.499
A CORRECTED AKAIKE INFORMATION CRITERION FOR VECTOR AUTOREGRESSIVE MODEL SELECTION, Journal of Time Series Analysis, vol.6, issue.3, pp.271-279, 1993. ,
DOI : 10.1214/aos/1176344897
Improved estimators of Kullback?Leibler information for autoregressive model selection in small samples, Biometrika, vol.77, issue.4, pp.709-719, 1990. ,
Estimation with Quadratic Loss, Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability: Held at the Statistical Laboratory, p.361, 1960. ,
DOI : 10.1007/978-1-4612-0919-5_30
On inadmissibility of some unbiased estimates of loss. Statistical Decision Theory and Related Topics, pp.361-379, 1988. ,
Robustness of Statistical Tests, pp.72-73, 1989. ,
Multivariate t Distributions and their Applications, 2004. ,
The Laplace Distribution and Generalizations: A Revisit with Applications to Communications, Economics, Engineering , and Finance, issue.183, 2001. ,
DOI : 10.1007/978-1-4612-0173-1
Une Introduction au Critère BIC: Fondements Théoriques et Interprétation, Journal de la Société française de statistique, pp.39-57, 2006. ,
MODEL SELECTION AND INFERENCE: FACTS AND FICTION, Econometric Theory, vol.307, issue.01, pp.21-59, 2005. ,
DOI : 10.1017/S0266466603191050__S0266466603191050
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.216.7997
Admissibility Results in Loss Estimation, The Annals of Statistics, vol.21, issue.1, pp.378-390, 1993. ,
DOI : 10.1214/aos/1176349031
A note on the lasso and related procedures in model selection, Statistica Sinica, vol.16, issue.132, pp.1273-1316, 2006. ,
From Stein's unbiased risk estimates to the method of generalized cross validation, Annals of Statistics, vol.13, issue.4, pp.1352-1377, 1985. ,
MODELING PHARMACOKINETIC DATA USING HEAVY-TAILED MULTIVARIATE DISTRIBUTIONS, Journal of Biopharmaceutical Statistics, vol.46, issue.3, pp.369-381, 2000. ,
DOI : 10.1007/BF01113502
Estimation of Normal Means: Frequentist Estimation of Loss, The Annals of Statistics, vol.17, issue.2, pp.890-906, 1989. ,
DOI : 10.1214/aos/1176347149
Complexity analysis of the lasso regularization path, Proceedings of the 29th International Conference on Machine Learning, p.2012 ,
Fully Bayes Factors with a Generalized g-prior. The Annals of Statistics, pp.2740-2765, 2011. ,
Concentration Inequalities and Model Selection: École d'Été de Probabilités de Saint-Flour XXXIII-2003. No. 1896, pp.14-33, 2007. ,
Data-driven penalty calibration: A case study for Gaussian mixture model selection, ESAIM: Probability and Statistics, pp.320-339, 2011. ,
DOI : 10.1051/ps/2010002
URL : https://hal.archives-ouvertes.fr/hal-00666813
Illustrations of the dynamical theory of gases.? Part I. On the motions and collisions of perfectly elastic spheres, The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, vol.19, issue.124, pp.19-32, 1860. ,
Regression and Time Series Model Selection, World Scientific, 1998. ,
DOI : 10.1142/3573
On the degrees of freedom in shaperestricted regression, Annals of Statistics, vol.28, issue.4, pp.1083-1104, 2000. ,
Clustering for binary data and mixture models ? choice of the models Applied stochastic models and data analysis, pp.269-278, 1998. ,
On the Relationship between Generalization Error, Hypothesis Complexity, and Sample Complexity for Radial Basis Functions, Neural Computation, vol.4, issue.4, pp.819-842, 1996. ,
DOI : 10.1137/1116025
Numerical optimization, 1999. ,
DOI : 10.1007/b98874
Improved estimation in a non-Gaussian parametric regression, Statistical Inference for Stochastic Processes, vol.62, issue.6, 2011. ,
DOI : 10.1007/s11203-013-9075-0
URL : https://hal.archives-ouvertes.fr/hal-00627557
Optimizing benefits from wind power participation in electricity market using advanced tools for wind power forecasting and uncertainty assessment, Proceedings of the 2004 European Wind Energy Conference, 2004. ,
URL : https://hal.archives-ouvertes.fr/hal-00529464
PM10 forecasting using clusterwise regression, Atmospheric Environment, vol.45, issue.38, pp.7005-7014, 2011. ,
DOI : 10.1016/j.atmosenv.2011.09.016
URL : https://hal.archives-ouvertes.fr/hal-00942963
Applied Regression Analysis: A Research Tool, 1998. ,
DOI : 10.1007/b98890
Guaranteed minimum-rank solutions of linear matrix equations via nuclear norm minimization, SIAM Review, vol.52, issue.3, pp.471-501 ,
Improved Estimation in Lognormal Models, Journal of the American Statistical Association, vol.66, issue.396, pp.1046-1049, 1986. ,
DOI : 10.1080/01621459.1986.10478371
Estimated Loss and Admissible Loss Estimators. Statistical Decision Theory and Related Topics IV, p.409, 1988. ,
Loss Functions for Loss Estimation, The Annals of Statistics, vol.16, issue.3, pp.1262-1269, 1988. ,
DOI : 10.1214/aos/1176350960
Ancillary Statistics and Estimation of the Loss in Estimation Problems, Annals of Mathematical Statistics, vol.39, issue.5, pp.1756-1758, 1968. ,
An asymptotic theory for linear model selection, Statistica Sinica, vol.7, pp.221-242, 1997. ,
Asymptotic mean efficiency of a selection of regression variables, Annals of the Institute of Statistical Mathematics, vol.68, issue.1, pp.415-423, 1983. ,
DOI : 10.1007/BF02480998
Inadmissibility of the usual estimator for the mean of a multivariate normal distribution, Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, pp.197-206, 1955. ,
Estimation of the Mean of a Multivariate Normal Distribution, The Annals of Statistics, vol.9, issue.6, pp.1135-1151, 1981. ,
DOI : 10.1214/aos/1176345632
How to Compare Different Loss Functions and Their Risks, Constructive Approximation, vol.26, issue.2, pp.225-287, 2007. ,
DOI : 10.1007/s00365-006-0662-3
Distribution of informational statistics and a criterion of model fitting, Suri-Kagaku (Mathematical Sciences), vol.153, pp.12-18, 1976. ,
Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society. Series B (Methodological), vol.58, issue.1, pp.267-288, 1996. ,
On uniform convergence of the frequencies of events to their probabilities. Teoriya veroyatnostei i ee primeneniya, pp.264-279, 1971. ,
Statistical Learning Theory, pp.24-85, 1998. ,
Contributions to the Theory of Statistical Estimation and Testing Hypotheses, The Annals of Mathematical Statistics, vol.10, issue.4, pp.299-326, 1939. ,
DOI : 10.1214/aoms/1177732144
Nearly unbiased variable selection under minimax concave penalty, The Annals of Statistics, vol.38, issue.2, pp.894-942, 2010. ,
DOI : 10.1214/09-AOS729
URL : http://arxiv.org/abs/1002.4734
On model selection consistency of Lasso, Journal of Machine Learning Research, vol.7, issue.2, p.2541, 2007. ,
Regularization and variable selection via the elastic net, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.5, issue.2, pp.301-320, 2005. ,
DOI : 10.1073/pnas.201162998
On the adaptive elastic-net with a diverging number of parameters, The Annals of Statistics, vol.37, issue.4, p.1733, 2009. ,
DOI : 10.1214/08-AOS625
On the ???degrees of freedom??? of the lasso, The Annals of Statistics, vol.35, issue.5, pp.2173-2192, 2007. ,
DOI : 10.1214/009053607000000127
The Adaptive Lasso and Its Oracle Properties, Journal of the American Statistical Association, vol.101, issue.476, pp.1418-1429, 2006. ,
DOI : 10.1198/016214506000000735