Assessing linearity in high dimensions

Owen, Art B.

Owen, Art B.

Ann. Statist., Tome 28 (2000) no. 3, p. 1-19 / Harvested from Project Euclid

Résumé

Some standard numerical problems become intractable in high dimensions. Yet successes are often achieved in practice. This may be explained in terms of the underlying target function being somehow simpler than assumed in the intractability arguments. A prototypical form of simplicity is approximate linearity. In moderate dimensions, linearity can be investigated by linear regression. In very high dimensions, this becomes computationally inefficient and eventually infeasible, as the cost of regression for $n$ observations in $d$ dimensions grows as $nd^2$. This paper presents a quasi-regression method for determining the degree of linearity in a function, where the cost grows only as $nd$. A bias-corrected version of quasi-regression is able to estimate the degree of linearity with a sample size of order $d^{2 /3}$. An example is given of a function on $[0,1]^ {1,000,000}$, for which the amount of linear variability is accurately estimated from only 100,000 observations.

Publié le : 2000-02-14
Classification: Computer experiment, function mining, Monte Carlo, quasi- interpolation, quasi-regression, tractability., 65C05, 65D15, 41A58, 41A63.

@article{1016120362,
     author = {Owen, Art B.},
     title = {Assessing linearity in high dimensions},
     journal = {Ann. Statist.},
     volume = {28},
     number = {3},
     year = {2000},
     pages = { 1-19},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1016120362}
}

Owen, Art B. Assessing linearity in high dimensions. Ann. Statist., Tome 28 (2000) no. 3, pp.  1-19. http://gdmltest.u-ga.fr/item/1016120362/