|
|
||||||||
Departamento de Computación, Universidad Simón Bolívar, Caracas 89000, Venezuela
We establish a bound on the convergence time of the value iteration algorithm on stochastic shortest-path problems. The bound, which applies for admissible initial vectors as, for example, J\equiv 0, implies a polynomial-time convergence of value iteration for all problems with polynomially bounded \Vert{J^*}\Vert/\underline{g}. This result gives a partial answer to the open problem of bounding the convergence time of value iteration on arbitrary initial vectors. The proof is obtained by analyzing a stochastic process associated with the shortest-path problem.
bonet{at}ldc.usb.ve, http://www.ldc.usb.ve/
bonet
History: Received: October 3, 2005;
revision received: August 28, 2006;
This article has been cited by other articles:
![]() |
A. Vladimirsky Label-Setting Methods for Multimode Stochastic Shortest Path Problems on Graphs Mathematics of Operations Research, November 1, 2008; 33(4): 821 - 838. [Abstract] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |