On adaptive control of a partially observed Markov chain

Tom 22 / 1994

Giovanni Di Masi, Łukasz Stettner Applicationes Mathematicae 22 (1994), 165-180 DOI: 10.4064/am-22-2-165-180

Streszczenie

A control problem for a partially observable Markov chain depending on a parameter with long run average cost is studied. Using uniform ergodicity arguments it is shown that, for values of the parameter varying in a compact set, it is possible to consider only a finite number of nearly optimal controls based on the values of actually computable approximate filters. This leads to an algorithm that guarantees nearly selfoptimizing properties without identifiability conditions. The algorithm is based on probing control, whose cost is additionally assumed to be periodically observable.

Autorzy

  • Giovanni Di Masi
  • Łukasz Stettner

Przeszukaj wydawnictwa IMPAN

Zbyt krótkie zapytanie. Wpisz co najmniej 4 znaki.

Przepisz kod z obrazka

Odśwież obrazek

Odśwież obrazek