A nested unsupervised approach to identifying novel molecular subtypes
Garrett, Elizabeth S. ; Parmigiani, Giovanni
Bernoulli, Tome 10 (2004) no. 2, p. 951-969 / Harvested from Project Euclid
In classification problems arising in genomics research it is common to study populations for which a broad class assignment is known (say, normal versus diseased) and one seeks undiscovered subclasses within one or both of the known classes. Formally, this problem can be thought of as an unsupervised analysis nested within a supervised one. Here we take the view that the nested unsupervised analysis can successfully utilize information from the entire data set for constructing and/or selecting useful predictors. Specifically, we propose a mixture model approach to the nested unsupervised problem, where the supervised information is used to develop latent classes which are in turn used for data mining and robust unsupervised analysis. Our solution is illustrated using data on molecular classification of lung adenocarcinoma.
Publié le : 2004-12-14
Classification:  Bayesian model,  class discovery,  gene expression,  lung cancer
@article{1106314845,
     author = {Garrett, Elizabeth S. and Parmigiani, Giovanni},
     title = {A nested unsupervised approach to identifying novel molecular subtypes},
     journal = {Bernoulli},
     volume = {10},
     number = {2},
     year = {2004},
     pages = { 951-969},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1106314845}
}
Garrett, Elizabeth S.; Parmigiani, Giovanni. A nested unsupervised approach to identifying novel molecular subtypes. Bernoulli, Tome 10 (2004) no. 2, pp.  951-969. http://gdmltest.u-ga.fr/item/1106314845/