SMARTech   Library Home
 

Georgia Tech's Institutional Repository >
College of Computing (CoC) >
School of Interactive Computing (SIC) >
School of Interactive Computing Technical Reports  >

Title: Horizon-based Value Iteration
Authors: Zang, Peng
Irani, Arya
Isbell, Charles
Subjects : Convergence
Reinforcement learning
Reverse value iteration (RVI)
Value iteration
Issue Date: 2007
Publisher: Georgia Institute of Technology
Series/Report no.: SIC Technical Reports; GIT-IC-07-07
Abstract: We present a horizon-based value iteration algorithm called Reverse Value Iteration (RVI). Empirical results on a variety of domains, both synthetic and real, show RVI often yields speedups of several orders of magnitude. RVI does this by ordering backups by horizons, with preference given to closer horizons, thereby avoiding many unnecessary and incorrect backups. We also compare to related work, including prioritized and partitioned value iteration approaches, and show that our technique performs favorably. The techniques presented in RVI are complementary and can be used in conjunction with previous techniques. We prove that RVI converges and often has better (but never worse) complexity than standard value iteration. To the authors’ knowledge, this is the first comprehensive theoretical and empirical treatment of such an approach to value iteration.
URI: http://hdl.handle.net/1853/19893
Appears in Collections:School of Interactive Computing Technical Reports

Files in This Item:

File Description SizeFormat
GIT-IC-07-07.pdf166.88 kBAdobe PDFView/Open

Items in SMARTech are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2007 MIT and Hewlett-Packard - Feedback