The consistency of the BIC Markov order estimator

Csiszár, Imre; Shields, Paul C.

Ann. Statist., Tome 28 (2000) no. 3, p. 1601-1619 / Harvested from Project Euclid

Résumé

The Bayesian Information Criterion (BIC) estimates the order of a Markov chain (with finite alphabet $A$) from observation of a sample path $x_1, x_2,\dots, x_n$, as that value $k = \hat{k}$ that minimizes the sum of the negative logarithm of the $k$th order maximum likelihood and the penalty term $\frac{|A|^k(|A|-1)}{2}\log n$. We show that $\hat{k}$ equals the correct order of the chain, eventually almost surely as $n \rightarrow \infty$, thereby strengthening earlier consistency results that assumed an apriori bound on the order. A key tool is a strong ratio-typicality result for Markov sample paths.We also show that the Bayesian estimator or minimum description length estimator, of which the BIC estimator is regarded as an approximation, fails to be consistent for the uniformly distributed i.i.d. process.

Publié le : 2000-12-14
Classification: Bayesian Information Criterion, order estimation, ratio-typicality, Markov chains, 62F12, 62M05, 62F13, 60J10

@article{1015957472,
     author = {Csisz\'ar, Imre and Shields, Paul C.},
     title = {The consistency of the BIC Markov order estimator},
     journal = {Ann. Statist.},
     volume = {28},
     number = {3},
     year = {2000},
     pages = { 1601-1619},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1015957472}
}

Csiszár, Imre; Shields, Paul C. The consistency of the BIC Markov order estimator. Ann. Statist., Tome 28 (2000) no. 3, pp.  1601-1619. http://gdmltest.u-ga.fr/item/1015957472/