|
|
||||||||
Department of Management Science and Engineering, Terman 316, Stanford University, Stanford, California 94305
We present a new complexity result on solving the Markov decision problem (MDP) with n states and a number of actions for each state, a special class of real-number linear programs with the Leontief matrix structure. We prove that when the discount factor
yinyu-ye{at}stanford.edu, www.stanford.edu/~yyye
is strictly less than 1, the problem can be solved in at most O(n1.5(log 1/(1
) + log n)) classical interior-point method iterations and O(n4(log 1/(1
) + log n)) arithmetic operations. Our method is a combinatorial interior-point method related to the work of Ye (1990. A "build-down" scheme for linear programming. Math. Programming 46 6172) and Vavasis and Ye (1996. A primal-dual interior-point method whose running time depends only on the constraint matrix. Math. Programming 74 79120). To our knowledge, this is the first strongly polynomial-time algorithm for solving the MDP when the discount factor is a constant less than 1.
History: Received: November 4, 2003;
revision received: October 3, 2004;revision received: December 27, 2004;
This article has been cited by other articles:
![]() |
A. Zadorojniy, G. Even, and A. Shwartz A Strongly Polynomial Algorithm for Controlled Queues Mathematics of Operations Research, November 1, 2009; 34(4): 992 - 1007. [Abstract] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |