A. Boisbunon, S. Canu, and D. Fourdrinier, Sélection de variables dans le modèle linéaire Report for the midterm review of project ClasSel References [Akaike 1970] H. Akaike. Statistical predictor identification, Annals of the Institute of Statistical Mathematics, vol.22, issue.1, pp.203-217, 1970.

]. H. Akaike-1973 and . Akaike, Information theory and an extension of the maximum likelihood principle, Second International Symposium on Information Theory, pp.267-281

]. H. Akaike-1974 and . Akaike, A new look at the statistical model identification, IEEE Transactions on Automatic Control, vol.19, issue.6, pp.716-723, 1974.
DOI : 10.1109/TAC.1974.1100705

A. and ]. D. Allen, The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction, Technometrics, vol.23, issue.3, pp.125-127, 1974.
DOI : 10.2307/1267352

&. An, ]. L. Tao, P. D. An, and . Tao, The DC (Difference of Convex Functions) Programming and DCA Revisited with DC Models of Real World Nonconvex Optimization Problems, Annals of Operations Research, vol.6, issue.1, pp.23-46, 2005.
DOI : 10.1007/s10479-004-5022-1

&. Andrews, C. L. Andrews, and . Mallows, Scale mixtures of normal distributions, Journal of the Royal Statistical Society. Series B (Methodological), vol.36, pp.99-102, 1974.

&. Arlot, S. Bach, F. Arlot, and . Bach, Data-driven calibration of linear estimators with minimal penalties, Advances in Neural Information Processing Systems 22, pp.46-54, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00414774

&. Arlot, . Celisse-2010-]-s, A. Arlot, and . Celisse, A survey of cross-validation procedures for model selection, Statistics Surveys, vol.4, issue.0, pp.40-79, 2010.
DOI : 10.1214/09-SS054
URL : https://hal.archives-ouvertes.fr/hal-00407906

&. Arlot, ]. S. Massart, P. Arlot, and . Massart, Data-driven calibration of penalties for leastsquares regression, Journal of Machine Learning Research, vol.10, pp.245-279, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00287631

. Bach, Optimization for Machine Learning, Chapter Convex optimization with sparsity-inducing norms, pp.19-54, 2011.

C. Baraud, S. Giraud, and . Huet, Gaussian model selection with an unknown variance, The Annals of Statistics, vol.37, issue.2, pp.630-672, 2009.
DOI : 10.1214/07-AOS573
URL : https://hal.archives-ouvertes.fr/hal-00756074

]. A. Barron, Universal approximation bounds for superpositions of a sigmoidal function, IEEE Transactions on Information Theory, vol.39, issue.3, pp.291-319, 1993.
DOI : 10.1109/18.256500

]. A. Barron, Approximation and estimation bounds for artificial neural networks, Machine Learning, pp.115-133, 1994.

. Bartlett, Model Selection and Error Estimation, Machine Learning, pp.85-113, 2002.
DOI : 10.2139/ssrn.248567

&. Bendel, ]. R. Afifi, A. A. Bendel, and . Afifi, Comparison of stopping rules in forward" stepwise" regression, Journal of the American Statistical Association, vol.72, pp.46-53, 1977.

. Bennett, Model selection via bilevel optimization, Neural Networks, 2006. IJCNN'06. International Joint Conference on, pp.1922-1929, 2006.

. Bennett, Bilevel Optimization and Machine Learning, Computational Intelligence: Research Frontiers, pp.25-47, 2008.
DOI : 10.1007/978-3-540-68860-0_2
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.224.7503

]. J. Berger, Statistical Decision Theory and Bayesian Analysis, pp.15-18, 1985.
DOI : 10.1007/978-1-4757-4286-2

]. J. Berk, Necessary Conditions for the CAPM, Journal of Economic Theory, vol.73, issue.1, pp.245-257, 1997.
DOI : 10.1006/jeth.1996.2218

. Bertsekas, Convex Analysis and Optimization . Athena Scientific optimization and computation series, Athena Scientific, 2003.

]. C. Biernacki, Choix de modèles en classification, 1997.

]. L. Birgé and P. Massart, Gaussian model selection, Journal of the European Mathematical Society, vol.3, issue.3, pp.203-268, 2001.
DOI : 10.1007/s100970100031

&. Birgé, ]. L. Massart, P. Birgé, and . Massart, Minimal penalties for Gaussian model selection . Probability Theory and Related Fields, pp.33-73, 2007.

. Boucheron, Theory of Classification: a Survey of Some Recent Advances, ESAIM: Probability and Statistics, vol.9, pp.323-375, 2005.
DOI : 10.1051/ps:2005018
URL : https://hal.archives-ouvertes.fr/hal-00017923

&. Boyd, ]. S. Vandenberghe, L. Boyd, and . Vandenberghe, Convex Optimization, 2004.

]. H. Bozdogan, Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions, Psychometrika, vol.10, issue.2, pp.345-370, 1987.
DOI : 10.1007/BF02294361

]. H. Bozdogan, Mixture-Model Cluster Analysis Using Model Selection Criteria and a New Informational Measure of Complexity, Proceedings of the First US/Japan Conference on the Frontiers of Statistical Modeling, pp.69-113, 1994.
DOI : 10.1007/978-94-011-0800-3_3

]. H. Bozdogan, Akaike's Information Criterion and Recent Developments in Information Complexity, Journal of Mathematical Psychology, vol.44, issue.1, pp.62-91, 2000.
DOI : 10.1006/jmps.1999.1277

&. Brandwein, ]. A. Strawderman, W. E. Brandwein, and . Strawderman, Generalizations of James-Stein Estimators Under Spherical Symmetry, The Annals of Statistics, vol.19, issue.3, pp.1639-1650, 1991.
DOI : 10.1214/aos/1176348267

]. P. Breheny and J. Huang, Coordinate descent algorithms for nonconvex penalized regression, with applications to biological feature selection, The Annals of Applied Statistics, vol.5, issue.1, pp.232-2011, 2011.
DOI : 10.1214/10-AOAS388

&. Breiman, ]. L. Friedman, J. H. Breiman, and . Friedman, Estimating Optimal Transformations for Multiple Regression and Correlation, Journal of the American Statistical Association, vol.41, issue.391, pp.580-598, 1985.
DOI : 10.1080/01621459.1985.10478157

]. L. Breiman, Better Subset Regression Using the Nonnegative Garrote, Technometrics, vol.37, issue.4, pp.373-384, 1995.
DOI : 10.1080/01621459.1980.10477428

]. L. Breiman, Heuristics of instability and stabilization in model selection, The Annals of Statistics, vol.24, issue.6, pp.2350-2383, 1996.
DOI : 10.1214/aos/1032181158

]. L. Brown, Fundamentals of Statistical Exponential Families: With Applications in Statistical Decision Theory. Lecture notes-monograph series, 1986.

&. Bruce, ]. A. Gao, H. Y. Bruce, and . Gao, Understanding WaveShrink: variance and bias estimation, Biometrika, vol.83, issue.4, pp.727-745, 1996.
DOI : 10.1093/biomet/83.4.727

M. H. Wegkamp, Two-stage model selection procedures in partially linear regression, Canadian Journal of Statistics, vol.32, issue.2, pp.105-118, 2004.

&. Burnham, ]. K. Anderson, D. R. Burnham, and . Anderson, Model Selection and Multimodel Inference: a Practical Information-Theoretic Approach, pp.29-37, 2002.
DOI : 10.1007/b97636

C. Caillerie and B. Michel, Model Selection for Simplicial Approximation, Foundations of Computational Mathematics, vol.33, issue.2, 2009.
DOI : 10.1007/s10208-011-9103-7
URL : https://hal.archives-ouvertes.fr/inria-00402091

]. E. Candès, Modern statistical estimation via oracle inequalities, Acta Numerica, vol.15, pp.257-326, 2006.
DOI : 10.1017/S0962492906230010

. Chaslot, Monte-Carlo Tree Search: A new framework for game AI, Proceedings of the Fourth Artificial Intelligence and Interactive Digital Entertainment Conference, pp.216-217, 2008.

]. V. Cherkassky and Y. Ma, Comparison of Model Selection for Regression, Neural Computation, vol.15, issue.7, pp.1691-1714, 2003.
DOI : 10.1162/neco.1994.6.5.851

&. Cherkassky, ]. V. Mulier, F. Cherkassky, and . Mulier, Learning from data: Concepts, Theory, and Methods, pp.12-14, 1998.
DOI : 10.1002/9780470140529

. Cherkassky, Model complexity control for regression using VC generalization bounds, IEEE Transactions on Neural Networks, vol.10, issue.5, pp.1075-1089, 1999.
DOI : 10.1109/72.788648

]. M. Chmielewski, Elliptically Symmetric Distributions: A Review and Bibliography, International Statistical Review / Revue Internationale de Statistique, vol.49, issue.1, pp.67-74, 1981.
DOI : 10.2307/1403038

N. L. Claeskens and . Hjort, Model Selection and Model Averaging. Cambridge Series on Statistical and Probabilistic Mathematics, 2008.

]. F. Clarke, Optimization and nonsmooth analysis, Classics In Applied Mathematics. Society for Industrial and Applied Mathematics, vol.5, 1990.
DOI : 10.1137/1.9781611971309

]. D. Donoho and I. M. Johnstone, Ideal spatial adaptation by wavelet shrinkage, Biometrika, vol.81, issue.3, p.425, 1994.
DOI : 10.1093/biomet/81.3.425

. Dornhege, Toward brain-computer interfacing, 2007.

M. Kierczak, J. Koronacki, and J. Komorowski, Monte Carlo feature selection and interdependency discovery in supervised classification, Advances in Machine Learning II, pp.371-385, 2010.

&. Du, ]. J. Ma-2011, C. Du, and . Ma, Spherically Invariant Vector Random Fields in Space and Time, IEEE Transactions on Signal Processing, vol.59, issue.12, pp.5921-5929, 2011.
DOI : 10.1109/TSP.2011.2166391

I. Hastie, R. Johnstone, and . Tibshirani, Least angle regression (with discussions and authors reply), Annals of Statistics, vol.32, issue.42, pp.407-451, 2004.

]. B. Efron, How Biased is the Apparent Error Rate of a Prediction Rule?, Journal of the American Statistical Association, vol.39, issue.394, pp.461-470, 1986.
DOI : 10.1080/01621459.1986.10478291

K. Fan and . Fang, Inadmissibility of sample mean and regression coefficients for elliptically contoured distributions, Northeastern Mathematical Journal, vol.1, pp.68-81, 1985.

&. Fan, ]. J. Li, R. Fan, and . Li, Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties, Journal of the American Statistical Association, vol.96, issue.456, pp.1348-1360, 2001.
DOI : 10.1198/016214501753382273

C. Y. Fan and . Tang, Tuning parameter selection in high dimensional penalized likelihood, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.36, issue.3, 2012.
DOI : 10.1111/rssb.12001

. Fang, Symmetric Multivariate and Related Distributions, of Monographs on Statistics and Applied Probability. Chapman & Hall/CRC, pp.72-77, 1989.
DOI : 10.1007/978-1-4899-2937-2

]. W. Feller, An Introduction to Probability Theory and its Applications, Series in Probability and Mathematical Statistics, 1966.

. Févotte, Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009.
DOI : 10.1016/j.sigpro.2007.01.024

]. R. Flamary, Apprentissage statistique pour le signal: applications aux interfaces cerveau-machine, 2011.
URL : https://hal.archives-ouvertes.fr/tel-00687501

&. Foster, ]. D. George, E. I. Foster, and . George, The Risk Inflation Criterion for Multiple Regression, The Annals of Statistics, vol.22, issue.4, pp.1947-1975, 1994.
DOI : 10.1214/aos/1176325766

]. D. Fourdrinier and W. E. Strawderman, On Bayes and unbiased estimators of loss, Annals of the Institute of Statistical Mathematics, vol.9, issue.2, pp.803-816, 2003.
DOI : 10.1007/BF02523394

&. Fourdrinier, W. E. Strawderman-fourdrinier, and . Strawderman, Generalized Bayes minimax estimators of location vectors for spherically symmetric distributions, Journal of Multivariate Analysis, vol.99, issue.4, pp.735-750, 2008.
DOI : 10.1016/j.jmva.2007.03.007

]. D. Fourdrinier and W. E. Strawderman, Robust generalized Bayes minimax estimators of location vectors for spherically symmetric distribution with unknown scale. Borrowing Strength: Theory Powering Applications-A Festschrift for, pp.249-262, 2010.

]. D. Fourdrinier and M. Wells, Comparaisons de procédures de sélection d'un modèle de régression: une approche décisionnelle. Comptes rendus de l'Académie des sciences, Série 1, Mathématique, pp.865-870, 1994.

&. Fourdrinier, . Wells-1995a-]-d, M. T. Fourdrinier, and . Wells, Estimation of a Loss Function for Spherically Symmetric Distributions in the General Linear Model, The Annals of Statistics, vol.23, issue.2, pp.571-592, 1995.
DOI : 10.1214/aos/1176324536

&. Fourdrinier, . Wells-1995b-]-d, M. T. Fourdrinier, and . Wells, Loss Estimation for Spherically Symmetrical Distributions, Journal of Multivariate Analysis, vol.53, issue.2, pp.311-331, 1995.
DOI : 10.1006/jmva.1995.1039

]. D. Fourdrinier and M. T. Wells, On Improved Loss Estimation for Shrinkage Estimators, Statistical Science, vol.27, issue.1, pp.61-81, 2012.
DOI : 10.1214/11-STS380

. Fourdrinier, Shrinkage estimation , 2012, pp.58-118

]. J. Friedman, From Statistics to Neural Networks Theory and Pattern Recognition Applications, Chapter An overview of predictive learning and function approximation, pp.1-61, 1994.

G. Gasso, A. Rakotomamonjy, and S. Canu, Recovering Sparse Signals With a Certain Family of Nonconvex Penalties and DC Programming, IEEE Transactions on Signal Processing, vol.57, issue.12, pp.4686-4698, 2009.
DOI : 10.1109/TSP.2009.2026004
URL : https://hal.archives-ouvertes.fr/hal-00439453

&. Gaudel, . Sebag-2010-]-r, M. Gaudel, and . Sebag, Feature selection as a one-player game, International Conference on Machine Learning, pp.359-366, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00484049

]. S. Geisser, The Predictive Sample Reuse Method with Applications, Journal of the American Statistical Association, vol.36, issue.2, pp.320-328, 1975.
DOI : 10.1080/01621459.1975.10479865

]. M. Genton, The correlation structure of Matheron's classical variogram estimator under elliptically contoured distributions, Mathematical Geology, vol.32, issue.1, pp.127-137, 2000.
DOI : 10.1023/A:1007511019496

&. George, ]. E. Foster, D. P. George, and . Foster, Calibration and empirical Bayes variable selection, Biometrika, vol.87, issue.4, pp.731-747, 2000.
DOI : 10.1093/biomet/87.4.731
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.18.3731

]. E. George-2000 and . George, The Variable Selection Problem, Journal of the American Statistical Association, vol.7, issue.2, pp.1304-1308, 2000.
DOI : 10.1214/aos/1176349027

. Gneiting, Statistical Methods for Spatio- Temporal Systems Chapter Geostatistical space-time models, stationarity, separability, and full symmetry, of Monographs on Statistics and Applied Probability, pp.151-175, 2007.

. Golub, Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter, Technometrics, vol.5, issue.2, pp.215-223, 1979.
DOI : 10.1080/03610927508827223

&. Govaert, . Nadif-2010-]-g, M. Govaert, and . Nadif, Latent Block Model for Contingency Table, Communications in Statistics - Theory and Methods, vol.24, issue.3, pp.416-425, 2010.
DOI : 10.1016/j.csda.2007.09.007
URL : https://hal.archives-ouvertes.fr/hal-00447792

&. Gupta, ]. A. Varga, T. Gupta, and . Varga, Elliptically Contoured Models in Statistics, 1993.
DOI : 10.1007/978-94-011-1646-6

. Guyon, Model selection: Beyond the bayesian/frequentist divide, Journal of Machine Learning Research, vol.11, pp.61-87, 2010.

]. I. Guyon, Machine learning summer school, Chapter A practical guide to model selection, 2009.

&. Hafner, ]. C. Rombouts, J. V. Hafner, and . Rombouts, SEMIPARAMETRIC MULTIVARIATE VOLATILITY MODELS, Econometric Theory, vol.52, issue.02, pp.251-280, 2007.
DOI : 10.1016/j.jimonfin.2006.04.006

]. W. Hager, Updating the Inverse of a Matrix, SIAM Review, vol.31, issue.2, pp.221-239, 1989.
DOI : 10.1137/1031049

&. Hannan, ]. E. Quinn, B. G. Hannan, and . Quinn, The determination of the order of an autoregression, Journal of the Royal Statistical Society. Series B (Methodological), vol.41, issue.2, pp.190-195, 1979.

&. Hare, ]. W. Sagastizábal, C. Hare, and . Sagastizábal, Computing proximal points of nonconvex functions, Mathematical Programming, pp.221-258, 2009.
DOI : 10.1007/s10107-007-0124-6

&. Hastie, ]. T. Tibshirani, R. J. Hastie, and . Tibshirani, Generalized Additive Models, Chapman & Hall/CRC, 1990.

. Hastie, The Elements of Statistical Learning: Data mining, Inference and Prediction, 2005.

. Hastie, The Elements of Statistical Learning: Data Mining, Inference and Prediction, pp.14-18, 2008.

]. R. Hocking, A Biometrics Invited Paper. The Analysis and Selection of Variables in Linear Regression, Biometrics, vol.32, issue.1, pp.1-49, 1976.
DOI : 10.2307/2529336

&. Hoerl, ]. A. Kennard, R. W. Hoerl, and . Kennard, Ridge Regression: Applications to Nonorthogonal Problems, Technometrics, vol.3, issue.1, pp.69-82, 1970.
DOI : 10.2307/1266192

]. P. Huber, Robust Estimation of a Location Parameter, The Annals of Mathematical Statistics, vol.35, issue.1, pp.73-101, 1964.
DOI : 10.1214/aoms/1177703732

]. P. Huber, A Survey of Statistical Design and Linear Models, Chapter Robustness and designs, pp.287-303, 1975.

]. P. Huber, Robust Statistics, volume 67 of Wiley series in probability and mathematical statistics, 1981.

&. Hurvich, . M. Tsai-1989-]-c, C. L. Hurvich, and . Tsai, Regression and time series model selection in small samples, Biometrika, vol.76, issue.2, pp.297-307, 1989.
DOI : 10.1093/biomet/76.2.297

&. Hurvich, . M. Tsai-1991-]-c, C. L. Hurvich, and . Tsai, Bias of the corrected AIC criterion for underfitted regression and time series models, Biometrika, vol.78, issue.3, pp.499-509, 1991.
DOI : 10.1093/biomet/78.3.499

&. Hurvich, . M. Tsai-1993-]-c, C. L. Hurvich, and . Tsai, A CORRECTED AKAIKE INFORMATION CRITERION FOR VECTOR AUTOREGRESSIVE MODEL SELECTION, Journal of Time Series Analysis, vol.6, issue.3, pp.271-279, 1993.
DOI : 10.1214/aos/1176344897

. Hurvich, Improved estimators of Kullback?Leibler information for autoregressive model selection in small samples, Biometrika, vol.77, issue.4, pp.709-719, 1990.

&. James, ]. W. Stein, C. James, and . Stein, Estimation with Quadratic Loss, Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability: Held at the Statistical Laboratory, p.361, 1960.
DOI : 10.1007/978-1-4612-0919-5_30

]. I. Johnstone-1988 and . Johnstone, On inadmissibility of some unbiased estimates of loss. Statistical Decision Theory and Related Topics, pp.361-379, 1988.

&. Kariya, ]. T. Sinha, B. K. Kariya, and . Sinha, Robustness of Statistical Tests, pp.72-73, 1989.

&. Kotz, ]. S. Nadarajah, S. Kotz, and . Nadarajah, Multivariate t Distributions and their Applications, 2004.

. Kotz, The Laplace Distribution and Generalizations: A Revisit with Applications to Communications, Economics, Engineering , and Finance, issue.183, 2001.
DOI : 10.1007/978-1-4612-0173-1

&. Lebarbier, ]. E. Mary-huard, T. Lebarbier, and . Mary-huard, Une Introduction au Critère BIC: Fondements Théoriques et Interprétation, Journal de la Société française de statistique, pp.39-57, 2006.

&. Leeb, ]. H. Pötscher, B. M. Leeb, and . Pötscher, MODEL SELECTION AND INFERENCE: FACTS AND FICTION, Econometric Theory, vol.307, issue.01, pp.21-59, 2005.
DOI : 10.1017/S0266466603191050__S0266466603191050
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.216.7997

]. C. Lele, Admissibility Results in Loss Estimation, The Annals of Statistics, vol.21, issue.1, pp.378-390, 1993.
DOI : 10.1214/aos/1176349031

G. Lin and . Wahba, A note on the lasso and related procedures in model selection, Statistica Sinica, vol.16, issue.132, pp.1273-1316, 2006.

]. K. Li-1985 and . Li, From Stein's unbiased risk estimates to the method of generalized cross validation, Annals of Statistics, vol.13, issue.4, pp.1352-1377, 1985.

&. Lindsey, ]. J. Jones, B. Lindsey, and . Jones, MODELING PHARMACOKINETIC DATA USING HEAVY-TAILED MULTIVARIATE DISTRIBUTIONS, Journal of Biopharmaceutical Statistics, vol.46, issue.3, pp.369-381, 2000.
DOI : 10.1007/BF01113502

&. Lu, . Berger, J. O. Lu, and . Berger, Estimation of Normal Means: Frequentist Estimation of Loss, The Annals of Statistics, vol.17, issue.2, pp.890-906, 1989.
DOI : 10.1214/aos/1176347149

&. Mairal, ]. J. Yu-2012, B. Mairal, and . Yu, Complexity analysis of the lasso regularization path, Proceedings of the 29th International Conference on Machine Learning, p.2012

&. Maruyama, E. I. George-maruyama, and . George, Fully Bayes Factors with a Generalized g-prior. The Annals of Statistics, pp.2740-2765, 2011.

]. P. Massart, Concentration Inequalities and Model Selection: École d'Été de Probabilités de Saint-Flour XXXIII-2003. No. 1896, pp.14-33, 2007.

]. C. Maugis and B. Michel, Data-driven penalty calibration: A case study for Gaussian mixture model selection, ESAIM: Probability and Statistics, pp.320-339, 2011.
DOI : 10.1051/ps/2010002
URL : https://hal.archives-ouvertes.fr/hal-00666813

]. J. Maxwell, Illustrations of the dynamical theory of gases.? Part I. On the motions and collisions of perfectly elastic spheres, The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, vol.19, issue.124, pp.19-32, 1860.

&. Mcquarrie, ]. A. Tsai, C. L. Mcquarrie, and . Tsai, Regression and Time Series Model Selection, World Scientific, 1998.
DOI : 10.1142/3573

&. Meyer, M. Woodroofe, and . Woodroofe, On the degrees of freedom in shaperestricted regression, Annals of Statistics, vol.28, issue.4, pp.1083-1104, 2000.

&. Nadif, G. Govaert, and . Govaert, Clustering for binary data and mixture models ? choice of the models Applied stochastic models and data analysis, pp.269-278, 1998.

&. Niyogi, . Girosi-1996-]-p, F. Niyogi, and . Girosi, On the Relationship between Generalization Error, Hypothesis Complexity, and Sample Complexity for Radial Basis Functions, Neural Computation, vol.4, issue.4, pp.819-842, 1996.
DOI : 10.1137/1116025

&. Nocedal, ]. J. Wright, S. J. Nocedal, and . Wright, Numerical optimization, 1999.
DOI : 10.1007/b98874

]. E. Pchelintsev, Improved estimation in a non-Gaussian parametric regression, Statistical Inference for Stochastic Processes, vol.62, issue.6, 2011.
DOI : 10.1007/s11203-013-9075-0
URL : https://hal.archives-ouvertes.fr/hal-00627557

. Pinson, Optimizing benefits from wind power participation in electricity market using advanced tools for wind power forecasting and uncertainty assessment, Proceedings of the 2004 European Wind Energy Conference, 2004.
URL : https://hal.archives-ouvertes.fr/hal-00529464

&. Poggi, . M. Portier-2011-]-j, B. Poggi, and . Portier, PM10 forecasting using clusterwise regression, Atmospheric Environment, vol.45, issue.38, pp.7005-7014, 2011.
DOI : 10.1016/j.atmosenv.2011.09.016
URL : https://hal.archives-ouvertes.fr/hal-00942963

. Rawlings, Applied Regression Analysis: A Research Tool, 1998.
DOI : 10.1007/b98890

M. Fazel and P. A. Parrilo, Guaranteed minimum-rank solutions of linear matrix equations via nuclear norm minimization, SIAM Review, vol.52, issue.3, pp.471-501

]. A. Rukhin, Improved Estimation in Lognormal Models, Journal of the American Statistical Association, vol.66, issue.396, pp.1046-1049, 1986.
DOI : 10.1080/01621459.1986.10478371

]. A. Rukhin, Estimated Loss and Admissible Loss Estimators. Statistical Decision Theory and Related Topics IV, p.409, 1988.

]. A. Rukhin, Loss Functions for Loss Estimation, The Annals of Statistics, vol.16, issue.3, pp.1262-1269, 1988.
DOI : 10.1214/aos/1176350960

E. Sandved, Ancillary Statistics and Estimation of the Loss in Estimation Problems, Annals of Mathematical Statistics, vol.39, issue.5, pp.1756-1758, 1968.

]. J. Shao, An asymptotic theory for linear model selection, Statistica Sinica, vol.7, pp.221-242, 1997.

]. R. Shibata, Asymptotic mean efficiency of a selection of regression variables, Annals of the Institute of Statistical Mathematics, vol.68, issue.1, pp.415-423, 1983.
DOI : 10.1007/BF02480998

]. C. Stein, Inadmissibility of the usual estimator for the mean of a multivariate normal distribution, Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, pp.197-206, 1955.

]. C. Stein, Estimation of the Mean of a Multivariate Normal Distribution, The Annals of Statistics, vol.9, issue.6, pp.1135-1151, 1981.
DOI : 10.1214/aos/1176345632

]. I. Steinwart, How to Compare Different Loss Functions and Their Risks, Constructive Approximation, vol.26, issue.2, pp.225-287, 2007.
DOI : 10.1007/s00365-006-0662-3

]. K. Takeuchi, Distribution of informational statistics and a criterion of model fitting, Suri-Kagaku (Mathematical Sciences), vol.153, pp.12-18, 1976.

]. R. Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society. Series B (Methodological), vol.58, issue.1, pp.267-288, 1996.

]. V. Vapnik and A. Y. Chervonenkis, On uniform convergence of the frequencies of events to their probabilities. Teoriya veroyatnostei i ee primeneniya, pp.264-279, 1971.

]. V. Vapnik, Statistical Learning Theory, pp.24-85, 1998.

]. A. Wald, Contributions to the Theory of Statistical Estimation and Testing Hypotheses, The Annals of Mathematical Statistics, vol.10, issue.4, pp.299-326, 1939.
DOI : 10.1214/aoms/1177732144

]. C. Zhang, Nearly unbiased variable selection under minimax concave penalty, The Annals of Statistics, vol.38, issue.2, pp.894-942, 2010.
DOI : 10.1214/09-AOS729
URL : http://arxiv.org/abs/1002.4734

&. Zhao, ]. P. Yu-2007, B. Zhao, and . Yu, On model selection consistency of Lasso, Journal of Machine Learning Research, vol.7, issue.2, p.2541, 2007.

&. Zou, ]. H. Hastie, T. J. Zou, and . Hastie, Regularization and variable selection via the elastic net, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.5, issue.2, pp.301-320, 2005.
DOI : 10.1073/pnas.201162998

&. Zou, ]. H. Zhang, H. H. Zou, and . Zhang, On the adaptive elastic-net with a diverging number of parameters, The Annals of Statistics, vol.37, issue.4, p.1733, 2009.
DOI : 10.1214/08-AOS625

. Zou, On the ???degrees of freedom??? of the lasso, The Annals of Statistics, vol.35, issue.5, pp.2173-2192, 2007.
DOI : 10.1214/009053607000000127

]. H. Zou, The Adaptive Lasso and Its Oracle Properties, Journal of the American Statistical Association, vol.101, issue.476, pp.1418-1429, 2006.
DOI : 10.1198/016214506000000735