Model Selection for CART Regression Trees
Gey, Servane ; Nédélec, Elodie
HAL, hal-00326549 / Harvested from HAL
The performance of the Classification And Regression Trees (CART) pruning algorithm and the final discrete selection by test-sample as a functional estimation procedure are considered. The validation of the pruning procedure applied to Gaussian and bounded regression is of primary interest. On the one hand, the paper shows that the complexity penalty used in the pruning algorithm is valid in both cases and, on the other hand, that, conditionally to the construction of the maximal tree, the final selection does not alter dramatically the estimation accuracy of the regression function. In both cases the risk bounds that are proved, obtained by using the penalized model selection, validate the CART algorithm which is used in many applications such as Meteorology, Biology, Medicine, Pollution or Image Coding.
Publié le : 2005-02-01
Classification:  Gaussian Regression,  Bounded Regression,  CART,  Pruning,  Model Selection,  62G08 ; 62J02 ; 62-07,  [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]
@article{hal-00326549,
     author = {Gey, Servane and N\'ed\'elec, Elodie},
     title = {Model Selection for CART Regression Trees},
     journal = {HAL},
     volume = {2005},
     number = {0},
     year = {2005},
     language = {en},
     url = {http://dml.mathdoc.fr/item/hal-00326549}
}
Gey, Servane; Nédélec, Elodie. Model Selection for CART Regression Trees. HAL, Tome 2005 (2005) no. 0, . http://gdmltest.u-ga.fr/item/hal-00326549/