Mathematics of Operations Research
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


MATHEMATICS OF OPERATIONS RESEARCH
Vol. 29, No. 2, May 2004, pp. 339-352
DOI: 10.1287/moor.1030.0077
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Jianyong, L.
Right arrow Articles by Xiaobo, Z.
Right arrow Search for Related Content

On Average Reward Semi-Markov Decision Processes with a General Multichain Structure

L. Jianyong, Z. Xiaobo

Institute of Applied Mathematics, Academia Sinica, Beijing 100080, China
Department of Industrial Engineering, Tsinghua University, Beijing 100084, China

liujy{at}mail.amss.ac.cn
xbzhao{at}mail.tsinghua.edu.cn

In this paper we investigate average reward semi-Markov decision processes with a general multichain structure using a data-transformation method. By solving the transformed discrete-time average Markov decision processes, we can obtain significant and interesting information on the original average semi-Markov decision processes. If the original semi-Markov decision processes satisfy some appropriate conditions, then stationary optimal policies in the transformed discrete-time models are also optimal in the original semi-Markov decision processes.

Key Words: semi-Markov decision processes; average reward criterion; multichain structure; data-transformation method; optimal policy
History: Received: April 29, 2002; revision received: June 2, 2003;revision received: June 7, 2003;





HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2004 by INFORMS.