Mathematics of Operations Research
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


MATHEMATICS OF OPERATIONS RESEARCH
Vol. 28, No. 3, August 2003, pp. 553-586
DOI: 10.1287/moor.28.3.553.16388
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Leizarowitz, A.
Right arrow Search for Related Content

An Algorithm to Identify and Compute Average Optimal Policies in Multichain Markov Decision Processes

Arie Leizarowitz

Department of Mathematics, Technion, Technion City, Haifa 32000, Israel
la{at}techunix.technion.ac.il

This paper concerns discrete-time, finite state multichain MDPs with compact action sets. The optimality criterion is long-run average cost. Simple examples illustrate that optimal stationary Markov policies do not always exist. We establish the existence of {varepsilon}-optimal policies that are stationary Markovian, and develop an algorithm that computes these approximate optimal policies. We establish a necessary and sufficient condition for the existence of an optimal policy that is stationary Markovian, and in case that such an optimal policy exists the algorithm computes it.

Key Words: Markov decision process; long-run average cost; multichain MDP; communication classes; finite state space; computation algorithm; approximate optimal policies; optimal policies; compact action sets
History: Received: September 6, 1998; revision received: August 10, 1999;revision received: July 13, 2000;revision received: September 26, 2001;revision received: May 19, 2002;revision received: January 28, 2003;


This article has been cited by other articles:


Home page
Mathematics of Operations ResearchHome page
A. Leizarowitz and A. J. Zaslavski
Uniqueness and Stability of Optimal Policies of Finite State Markov Decision Processes
Mathematics of Operations Research, February 1, 2007; 32(1): 156 - 167.
[Abstract] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2003 by INFORMS.