Deterministic optimal policies for Markov control processes with pathwise constraints

Armando F. Mendoza-Pérez; Onésimo Hernández-Lerma

Armando F. Mendoza-Pérez ; Onésimo Hernández-Lerma

Applicationes Mathematicae, Tome 39 (2012), p. 185-209 / Harvested from The Polish Digital Mathematics Library

Résumé

This paper deals with discrete-time Markov control processes in Borel spaces with unbounded rewards. Under suitable hypotheses, we show that a randomized stationary policy is optimal for a certain expected constrained problem (ECP) if and only if it is optimal for the corresponding pathwise constrained problem (pathwise CP). Moreover, we show that a certain parametric family of unconstrained optimality equations yields convergence properties that lead to an approximation scheme which allows us to obtain constrained optimal policies as the limit of unconstrained deterministic optimal policies. In addition, we give sufficient conditions for the existence of deterministic policies that solve these constrained problems.

Publié le : 2012-01-01

Zbl 1242.93145

EUDML-ID : urn:eudml:doc:280023

@article{bwmeta1.element.bwnjournal-article-doi-10_4064-am39-2-6,
     author = {Armando F. Mendoza-P\'erez and On\'esimo Hern\'andez-Lerma},
     title = {Deterministic optimal policies for Markov control processes with pathwise constraints},
     journal = {Applicationes Mathematicae},
     volume = {39},
     year = {2012},
     pages = {185-209},
     zbl = {1242.93145},
     language = {en},
     url = {http://dml.mathdoc.fr/item/bwmeta1.element.bwnjournal-article-doi-10_4064-am39-2-6}
}

Armando F. Mendoza-Pérez; Onésimo Hernández-Lerma. Deterministic optimal policies for Markov control processes with pathwise constraints. Applicationes Mathematicae, Tome 39 (2012) pp. 185-209. http://gdmltest.u-ga.fr/item/bwmeta1.element.bwnjournal-article-doi-10_4064-am39-2-6/