We provide a short and elementary proof of the Gittins index theorem for the multi-armed bandit problem, for the case where each bandit is modeled as a finite-state semi-Markov process. We also indicate how this proof can be extended to the branching bandits and Klimov problems.
@article{1177005207,
author = {Tsitsiklis, John N.},
title = {A Short Proof of the Gittins Index Theorem},
journal = {Ann. Appl. Probab.},
volume = {4},
number = {4},
year = {1994},
pages = { 194-199},
language = {en},
url = {http://dml.mathdoc.fr/item/1177005207}
}
Tsitsiklis, John N. A Short Proof of the Gittins Index Theorem. Ann. Appl. Probab., Tome 4 (1994) no. 4, pp. 194-199. http://gdmltest.u-ga.fr/item/1177005207/