On adaptive control of a partially observed Markov chain

Giovanni Di Masi; Łukasz Stettner

doi:10.4064/am-22-2-165-180

Instytut Matematyczny Polskiej Akademii Nauk / Institute of Mathematics / Publishing house / Journals and Serials / Applicationes Mathematicae / All issues

Search for IMPAN publications

On adaptive control of a partially observed Markov chain

Volume 22 / 1994

Giovanni Di Masi, Łukasz Stettner Applicationes Mathematicae 22 (1994), 165-180 DOI: 10.4064/am-22-2-165-180

Abstract

A control problem for a partially observable Markov chain depending on a parameter with long run average cost is studied. Using uniform ergodicity arguments it is shown that, for values of the parameter varying in a compact set, it is possible to consider only a finite number of nearly optimal controls based on the values of actually computable approximate filters. This leads to an algorithm that guarantees nearly selfoptimizing properties without identifiability conditions. The algorithm is based on probing control, whose cost is additionally assumed to be periodically observable.

Authors

Giovanni Di Masi
Łukasz Stettner

Free download under CC-BY license

Search for IMPAN publications

Instytut Matematyczny Polskiej Akademii Nauk / Institute of Mathematics / Publishing house / Journals and Serials / Applicationes Mathematicae / All issues

Applicationes Mathematicae

On adaptive control of a partially observed Markov chain

Volume 22 / 1994

Abstract

Authors

Search for IMPAN publications

Rewrite code from the image