Continuous-time controlled Markov chains
Guo, Xianping ; Hernández-Lerma, Onésimo
Ann. Appl. Probab., Tome 13 (2003) no. 1, p. 363-388 / Harvested from Project Euclid
This paper concerns studies on continuous-time controlled Markov chains, that is, continuous-time Markov decision processes with a denumerable state space, with respect to the discounted cost criterion. The cost and transition rates are allowed to be unbounded and the action set is a Borel space. We first study control problems in the class of deterministic stationary policies and give very weak conditions under which the existence of $\varepsilon$-optimal ($\varepsilon\geq 0)$ policies is proved using the construction of a minimum Q-process. Then we further consider control problems in the class of randomized Markov policies for (1) regular and (2) nonregular Q-processes. To study case (1), first we present a new necessary and sufficient condition for a nonhomogeneous Q-process to be regular. This regularity condition, together with the extended generatorof a nonhomogeneous Markov process, is used to prove the existence of $\varepsilon$-optimal stationary policies. Our results for case (1) are illustrated by a Schlögl model with a controlled diffusion. For case (2), we obtain a similar result using Kolmogorov's forward equation for the minimum Q-process and we also present an example in which our assumptions are satisfied, but those used in the previous literature fail to hold.
Publié le : 2003-01-14
Classification:  Nonhomogeneous continuous-time Markov chains,  controlled Q-processes,  unbounded cost and transition rates,  discounted criterion,  optimal stationary policies,  93E20,  60J27,  90C40
@article{1042765671,
     author = {Guo, Xianping and Hern\'andez-Lerma, On\'esimo},
     title = {Continuous-time controlled Markov chains},
     journal = {Ann. Appl. Probab.},
     volume = {13},
     number = {1},
     year = {2003},
     pages = { 363-388},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1042765671}
}
Guo, Xianping; Hernández-Lerma, Onésimo. Continuous-time controlled Markov chains. Ann. Appl. Probab., Tome 13 (2003) no. 1, pp.  363-388. http://gdmltest.u-ga.fr/item/1042765671/