Estimating the population mean when some responses are missing
Lu, Jingsong ; Stanek III, Edward J. ; Puleo, Elaine
Braz. J. Probab. Stat., Tome 25 (2011) no. 1, p. 171-182 / Harvested from Project Euclid
We develop a design-based prediction approach to estimate the finite population mean in a simple setting where some responses are missing. The approach is based on indicator sampling random variables that operate on labeled units (subjects). We define missing data mechanisms that may depend on a subject, or on a selection (such as when the study design assigns groups of selected subjects to different interviewers). Using an approach usually reserved for model-based inference, we develop a predictor that equals the sample total divided by the expected sample size. The methods are based on best linear unbiased prediction in finite population mixed models. When the probability of missing is estimated from the sample, the empirical estimator simplifies to the mean of the realized nonmissing responses. The different missing data mechanisms are revealed by the notation that accounts for the labels and sample selections. The mean squared error (MSE) of the empirical estimator, counterintuitively, is smaller than the MSE if the probability of missing is known.
Publié le : 2011-07-15
Classification:  Simple random sampling,  missing data,  MCAR,  finite population,  best linear unbiased estimator (BLUE),  prediction
@article{1301577152,
     author = {Lu, Jingsong and Stanek III, Edward J. and Puleo, Elaine},
     title = {Estimating the population mean when some responses are missing},
     journal = {Braz. J. Probab. Stat.},
     volume = {25},
     number = {1},
     year = {2011},
     pages = { 171-182},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1301577152}
}
Lu, Jingsong; Stanek III, Edward J.; Puleo, Elaine. Estimating the population mean when some responses are missing. Braz. J. Probab. Stat., Tome 25 (2011) no. 1, pp.  171-182. http://gdmltest.u-ga.fr/item/1301577152/