Operator norm consistent estimation of large-dimensional sparse covariance matrices
El Karoui, Noureddine
Ann. Statist., Tome 36 (2008) no. 1, p. 2717-2756 / Harvested from Project Euclid
Estimating covariance matrices is a problem of fundamental importance in multivariate statistics. In practice it is increasingly frequent to work with data matrices X of dimension n×p, where p and n are both large. Results from random matrix theory show very clearly that in this setting, standard estimators like the sample covariance matrix perform in general very poorly. ¶ In this “large n, large p” setting, it is sometimes the case that practitioners are willing to assume that many elements of the population covariance matrix are equal to 0, and hence this matrix is sparse. We develop an estimator to handle this situation. The estimator is shown to be consistent in operator norm, when, for instance, we have p≈n as n→∞. In other words the largest singular value of the difference between the estimator and the population covariance matrix goes to zero. This implies consistency of all the eigenvalues and consistency of eigenspaces associated to isolated eigenvalues. ¶ We also propose a notion of sparsity for matrices, that is, “compatible” with spectral analysis and is independent of the ordering of the variables.
Publié le : 2008-12-15
Classification:  Covariance matrices,  correlation matrices,  adjacency matrices,  eigenvalues of covariance matrices,  multivariate statistical analysis,  high-dimensional inference,  random matrix theory,  sparsity,  β-sparsity,  62H12
@article{1231165183,
     author = {El Karoui, Noureddine},
     title = {Operator norm consistent estimation of large-dimensional sparse covariance matrices},
     journal = {Ann. Statist.},
     volume = {36},
     number = {1},
     year = {2008},
     pages = { 2717-2756},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1231165183}
}
El Karoui, Noureddine. Operator norm consistent estimation of large-dimensional sparse covariance matrices. Ann. Statist., Tome 36 (2008) no. 1, pp.  2717-2756. http://gdmltest.u-ga.fr/item/1231165183/