Estimating high-dimensional intervention effects from observational data
Maathuis, Marloes H. ; Kalisch, Markus ; Bühlmann, Peter
Ann. Statist., Tome 37 (2009) no. 1, p. 3133-3164 / Harvested from Project Euclid
We assume that we have observational data generated from an unknown underlying directed acyclic graph (DAG) model. A DAG is typically not identifiable from observational data, but it is possible to consistently estimate the equivalence class of a DAG. Moreover, for any given DAG, causal effects can be estimated using intervention calculus. In this paper, we combine these two parts. For each DAG in the estimated equivalence class, we use intervention calculus to estimate the causal effects of the covariates on the response. This yields a collection of estimated causal effects for each covariate. We show that the distinct values in this set can be consistently estimated by an algorithm that uses only local information of the graph. This local approach is computationally fast and feasible in high-dimensional problems. We propose to use summary measures of the set of possible causal effects to determine variable importance. In particular, we use the minimum absolute value of this set, since that is a lower bound on the size of the causal effect. We demonstrate the merits of our methods in a simulation study and on a data set about riboflavin production.
Publié le : 2009-12-15
Classification:  Causal analysis,  directed acyclic graph (DAG),  graphical modeling,  intervention calculus,  PC-algorithm,  sparsity,  62-09,  62H99
@article{1250515382,
     author = {Maathuis, Marloes H. and Kalisch, Markus and B\"uhlmann, Peter},
     title = {Estimating high-dimensional intervention effects from observational data},
     journal = {Ann. Statist.},
     volume = {37},
     number = {1},
     year = {2009},
     pages = { 3133-3164},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1250515382}
}
Maathuis, Marloes H.; Kalisch, Markus; Bühlmann, Peter. Estimating high-dimensional intervention effects from observational data. Ann. Statist., Tome 37 (2009) no. 1, pp.  3133-3164. http://gdmltest.u-ga.fr/item/1250515382/