The Dantzig selector and sparsity oracle inequalities

Koltchinskii, Vladimir

Bernoulli, Tome 15 (2009) no. 1, p. 799-828 / Harvested from Project Euclid

Résumé

Let ¶ Y_j=f_*(X_j)+ξ_j, j=1, …, n, ¶ where X, X₁, …, X_n are i.i.d. random variables in a measurable space $(S,\mathcal{A})$ with distribution Π and ξ, ξ₁, …, ξ_n are i.i.d. random variables with ${\mathbb{E}}\xi=0$ independent of (X₁, …, X_n). Given a dictionary h₁, …, h_N: S↦ℝ, let f_λ:=∑_j=1^Nλ_jh_j, λ=(λ₁, …, λ_N)∈ℝ^N. Given ɛ>0, define ¶ ̂Λ_ɛ:={λ∈ℝ^N: max_1≤k≤N|n⁻¹∑_j=1ⁿ(f_λ(X_j)−Y_j)h_k(X_j)|≤ɛ} ¶ and ¶ ̂λ:=̂λ^ɛ∈Argmin_{λ∈̂Λ_ɛ}‖λ‖_ℓ₁. ¶ In the case where f_*:=f_λ^*, λ^*∈ℝ^N, Candes and Tao [Ann. Statist. 35 (2007) 2313–2351] suggested using ̂λ as an estimator of λ^*. They called this estimator “the Dantzig selector”. We study the properties of f_̂λ as an estimator of f_* for regression models with random design, extending some of the results of Candes and Tao (and providing alternative proofs of these results).

Publié le : 2009-08-15
Classification: Dantzig selector, oracle inequalities, regression, sparsity

@article{1251463282,
     author = {Koltchinskii, Vladimir},
     title = {The Dantzig selector and sparsity oracle inequalities},
     journal = {Bernoulli},
     volume = {15},
     number = {1},
     year = {2009},
     pages = { 799-828},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1251463282}
}

Koltchinskii, Vladimir. The Dantzig selector and sparsity oracle inequalities. Bernoulli, Tome 15 (2009) no. 1, pp.  799-828. http://gdmltest.u-ga.fr/item/1251463282/