Continuous-time controlled Markov chains

Guo, Xianping; Hernández-Lerma, Onésimo

Guo, Xianping ; Hernández-Lerma, Onésimo

Ann. Appl. Probab., Tome 13 (2003) no. 1, p. 363-388 / Harvested from Project Euclid

Résumé

This paper concerns studies on continuous-time controlled Markov chains, that is, continuous-time Markov decision processes with a denumerable state space, with respect to the discounted cost criterion. The cost and transition rates are allowed to be unbounded and the action set is a Borel space. We first study control problems in the class of deterministic stationary policies and give very weak conditions under which the existence of $\varepsilon$-optimal ($\varepsilon\geq 0)$ policies is proved using the construction of a minimum Q-process. Then we further consider control problems in the class of randomized Markov policies for (1) regular and (2) nonregular Q-processes. To study case (1), first we present a new necessary and sufficient condition for a nonhomogeneous Q-process to be regular. This regularity condition, together with the extended generatorof a nonhomogeneous Markov process, is used to prove the existence of $\varepsilon$-optimal stationary policies. Our results for case (1) are illustrated by a Schlögl model with a controlled diffusion. For case (2), we obtain a similar result using Kolmogorov's forward equation for the minimum Q-process and we also present an example in which our assumptions are satisfied, but those used in the previous literature fail to hold.

Publié le : 2003-01-14
Classification: Nonhomogeneous continuous-time Markov chains, controlled Q-processes, unbounded cost and transition rates, discounted criterion, optimal stationary policies, 93E20, 60J27, 90C40

@article{1042765671,
     author = {Guo, Xianping and Hern\'andez-Lerma, On\'esimo},
     title = {Continuous-time controlled Markov chains},
     journal = {Ann. Appl. Probab.},
     volume = {13},
     number = {1},
     year = {2003},
     pages = { 363-388},
     language = {en},
     url = {http://dml.mathdoc.fr/item/1042765671}
}

Guo, Xianping; Hernández-Lerma, Onésimo. Continuous-time controlled Markov chains. Ann. Appl. Probab., Tome 13 (2003) no. 1, pp.  363-388. http://gdmltest.u-ga.fr/item/1042765671/