A class of procedures based on "impartial trimming" (self-determined by the data) is introduced with the aim of robustifying k-means, hence the associated clustering analysis. We include a detailed study of optimal regions, showing that only nonpathological regions can arise from impartial trimming procedures. The asymptotic results provided in the paper focus on strong consistency of the suggested methods under widely general
conditions. A section is devoted to exploring the performance of the procedure to detect anomalous data in simulated data sets.
@article{1031833664,
author = {Cuesta-Albertos, J. A. and Gordaliza, A. and Matr\'an, C.},
title = {Trimmed $k$-means: an attempt to robustify quantizers},
journal = {Ann. Statist.},
volume = {25},
number = {6},
year = {1997},
pages = { 553-576},
language = {en},
url = {http://dml.mathdoc.fr/item/1031833664}
}
Cuesta-Albertos, J. A.; Gordaliza, A.; Matrán, C. Trimmed $k$-means: an attempt to robustify quantizers. Ann. Statist., Tome 25 (1997) no. 6, pp. 553-576. http://gdmltest.u-ga.fr/item/1031833664/