On the relative value iteration with a risk-sensitive criterion

Ari Arapostathis; Vivek S. Borkar

doi:10.4064/bc122-1

Instytut Matematyczny Polskiej Akademii Nauk / Institute of Mathematics / Publishing house / Banach Center Publications / All volumes

On the relative value iteration with a risk-sensitive criterion

Volume 122 / 2020

Ari Arapostathis, Vivek S. Borkar Banach Center Publications 122 (2020), 9-24 MSC: Primary: 90C40, 93E20, 49K40; Secondary: 60J25, 60J60. DOI: 10.4064/bc122-1

Abstract

A multiplicative relative value iteration algorithm for solving the dynamic programming equation for the risk-sensitive control problem is studied for discrete time controlled Markov chains with a compact Polish state space, and controlled diffusions in the whole Euclidean space. The main result is a proof of convergence to the desired limit in each case.

Authors

Ari ArapostathisDepartment of Electrical and Computer Engineering
The University of Texas at Austin
2501 Speedway, EER 7.824
Austin, TX 78712, USA
e-mail
Vivek S. BorkarDepartment of Electrical Engineering
Indian Institute of Technology
Powai, Mumbai 400076, India
e-mail

Free download under CC-BY license

Search for IMPAN publications

Instytut Matematyczny Polskiej Akademii Nauk / Institute of Mathematics / Publishing house / Banach Center Publications / All volumes

Banach Center Publications

On the relative value iteration with a risk-sensitive criterion

Volume 122 / 2020

Abstract

Authors

Search for IMPAN publications

Rewrite code from the image