A two-sample test for high-dimensional data with applications to gene-set testing
Chen, Song Xi ; Qin, Ying-Li
Ann. Statist., Tome 38 (2010) no. 1, p. 808-835 / Harvested from Project Euclid
We propose a two-sample test for the means of high-dimensional data when the data dimension is much larger than the sample size. Hotelling’s classical T2 test does not work for this “large p, small n” situation. The proposed test does not require explicit conditions in the relationship between the data dimension and sample size. This offers much flexibility in analyzing high-dimensional data. An application of the proposed test is in testing significance for sets of genes which we demonstrate in an empirical study on a leukemia data set.
Publié le : 2010-04-15
Classification:  High dimension,  gene-set testing,  large p small n,  martingale central limit theorem,  multiple comparison,  62H15,  60K35,  62G10
@article{1266586615,
     author = {Chen, Song Xi and Qin, Ying-Li},
     title = {A two-sample test for high-dimensional data with applications to gene-set testing},
     journal = {Ann. Statist.},
     volume = {38},
     number = {1},
     year = {2010},
     pages = { 808-835},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1266586615}
}
Chen, Song Xi; Qin, Ying-Li. A two-sample test for high-dimensional data with applications to gene-set testing. Ann. Statist., Tome 38 (2010) no. 1, pp.  808-835. http://gdmltest.u-ga.fr/item/1266586615/