Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion

J. Minjárez-Sosa

doi:10.4064/am-26-3-267-280

Instytut Matematyczny Polskiej Akademii Nauk / Institute of Mathematics / Publishing house / Journals and Serials / Applicationes Mathematicae / All issues

Search for IMPAN publications

Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion

Volume 26 / 1999

J. Minjárez-Sosa Applicationes Mathematicae 26 (1999), 267-280 DOI: 10.4064/am-26-3-267-280

Abstract

We introduce average cost optimal adaptive policies in a class of discrete-time Markov control processes with Borel state and action spaces, allowing unbounded costs. The processes evolve according to the system equations $x_{t+1}=F(x_t,a_t,ξ _t)$, t=1,2,..., with i.i.d. $ℝ^k$-valued random vectors $ξ_t$, which are observable but whose density ϱ is unknown.

Authors

J. Minjárez-Sosa

Free download under CC-BY license

Search for IMPAN publications

Instytut Matematyczny Polskiej Akademii Nauk / Institute of Mathematics / Publishing house / Journals and Serials / Applicationes Mathematicae / All issues

Applicationes Mathematicae

Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion

Volume 26 / 1999

Abstract

Authors

Search for IMPAN publications

Rewrite code from the image