|
|
||||||||
Department of Mathematics, Technion, Technion City, Haifa 32000, Israel
This paper concerns discrete-time, finite state multichain MDPs with compact action sets. The optimality criterion is long-run average cost. Simple examples illustrate that optimal stationary Markov policies do not always exist. We establish the existence of
la{at}techunix.technion.ac.il
-optimal policies that are stationary Markovian, and develop an algorithm that computes these approximate optimal policies. We establish a necessary and sufficient condition for the existence of an optimal policy that is stationary Markovian, and in case that such an optimal policy exists the algorithm computes it.
History: Received: September 6, 1998;
revision received: August 10, 1999;revision received: July 13, 2000;revision received: September 26, 2001;revision received: May 19, 2002;revision received: January 28, 2003;
This article has been cited by other articles:
![]() |
A. Leizarowitz and A. J. Zaslavski Uniqueness and Stability of Optimal Policies of Finite State Markov Decision Processes Mathematics of Operations Research, February 1, 2007; 32(1): 156 - 167. [Abstract] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |