Gaussian model selection

  • Pascal Massart

    Université Paris Sud, Orsay, France
  • Lucien Birgé

    Université Paris VI, France

Abstract

Our purpose in this paper is to provide a general approach to model selection via penalization for Gaussian regression and to develop our point of view about this subject. The advantage and importance of model selection come from the fact that it provides a suitable approach to many different types of problems, starting from model selection per se (among a family of parametric models, which one is more suitable for the data at hand), which includes for instance variable selection in regression models, to nonparametric estimation, for which it provides a very powerful tool that allows adaptation under quite general circumstances. Our approach to model selection also provides a natural connection between the parametric and nonparametric points of view and copes naturally with the fact that a model is not necessarily true. The method is based on the penalization of a least squares criterion which can be viewed as a generalization of Mallows' Cp. A large part of our efforts will be put on choosing properly the list of models and the penalty function for various estimation problems like classical variable selection or adaptive estimation for various types of lp-bodies.

Cite this article

Pascal Massart, Lucien Birgé, Gaussian model selection. J. Eur. Math. Soc. 3 (2001), no. 3, pp. 203–268

DOI 10.1007/S100970100031