We consider the problem of estimating the conditional mean of a real Gaussian variable Y=∑i=1pθiXi+ɛ where the vector of the covariates (Xi)1≤i≤p follows a joint Gaussian distribution. This issue often occurs when one aims at estimating the graph or the distribution of a Gaussian graphical model. We introduce a general model selection procedure which is based on the minimization of a penalized least squares type criterion. It handles a variety of problems such as ordered and complete variable selection, allows to incorporate some prior knowledge on the model and applies when the number of covariates p is larger than the number of observations n. Moreover, it is shown to achieve a non-asymptotic oracle inequality independently of the correlation structure of the covariates. We also exhibit various minimax rates of estimation in the considered framework and hence derive adaptivity properties of our procedure.
Publié le : 2010-05-15
Classification:
Model selection,
Linear regression,
Oracle inequalities,
Gaussian graphical models,
Minimax rates of estimation,
62J05,
62G08
@article{1273584132,
author = {Verzelen, Nicolas},
title = {High-dimensional Gaussian model selection on a Gaussian design},
journal = {Ann. Inst. H. Poincar\'e Probab. Statist.},
volume = {46},
number = {1},
year = {2010},
pages = { 480-524},
language = {en},
url = {http://dml.mathdoc.fr/item/1273584132}
}
Verzelen, Nicolas. High-dimensional Gaussian model selection on a Gaussian design. Ann. Inst. H. Poincaré Probab. Statist., Tome 46 (2010) no. 1, pp. 480-524. http://gdmltest.u-ga.fr/item/1273584132/