We analyze Gittins' Markovian model, as generalized by Varaiya,
Walrand and Buyukkoc, in discrete and continuous time. The approach resembles
Weber's modification of Whittle's, within the framework of both
multi-parameter processes and excursion theory. It is shown that index-priority
strategies are optimal, in concert with all the special cases that have been
treated previously.
@article{1028903380,
author = {Kaspi, Haya and Mandelbaum, Avishai},
title = {Multi-armed bandits in discrete and continuous time},
journal = {Ann. Appl. Probab.},
volume = {8},
number = {1},
year = {1998},
pages = { 1270-1290},
language = {en},
url = {http://dml.mathdoc.fr/item/1028903380}
}
Kaspi, Haya; Mandelbaum, Avishai. Multi-armed bandits in discrete and continuous time. Ann. Appl. Probab., Tome 8 (1998) no. 1, pp. 1270-1290. http://gdmltest.u-ga.fr/item/1028903380/