This paper considers estimating a covariance matrix of p variables from n observations by either banding or tapering the sample covariance matrix, or estimating a banded version of the inverse of the covariance. We show that these estimates are consistent in the operator norm as long as (log p)/n→0, and obtain explicit rates. The results are uniform over some fairly natural well-conditioned families of covariance matrices. We also introduce an analogue of the Gaussian white noise model and show that if the population covariance is embeddable in that model and well-conditioned, then the banded approximations produce consistent estimates of the eigenvalues and associated eigenvectors of the covariance matrix. The results can be extended to smooth versions of banding and to non-Gaussian distributions with sufficiently short tails. A resampling approach is proposed for choosing the banding parameter in practice. This approach is illustrated numerically on both simulated and real data.
@article{1201877299,
author = {Bickel, Peter J. and Levina, Elizaveta},
title = {Regularized estimation of large covariance matrices},
journal = {Ann. Statist.},
volume = {36},
number = {1},
year = {2008},
pages = { 199-227},
language = {en},
url = {http://dml.mathdoc.fr/item/1201877299}
}
Bickel, Peter J.; Levina, Elizaveta. Regularized estimation of large covariance matrices. Ann. Statist., Tome 36 (2008) no. 1, pp. 199-227. http://gdmltest.u-ga.fr/item/1201877299/