|
Georgia Tech's Institutional Repository >
College of Computing (CoC) >
School of Interactive Computing (SIC) >
School of Interactive Computing Technical Reports >
| Title: | Horizon-based Value Iteration |
| Authors: | Zang, Peng Irani, Arya Isbell, Charles |
| Subjects : | Convergence Reinforcement learning Reverse value iteration (RVI) Value iteration |
| Issue Date: | 2007 |
| Publisher: | Georgia Institute of Technology |
| Series/Report no.: | SIC Technical Reports; GIT-IC-07-07 |
| Abstract: | We present a horizon-based value iteration algorithm called Reverse
Value Iteration (RVI). Empirical results on a variety of domains,
both synthetic and real, show RVI often yields speedups of
several orders of magnitude. RVI does this by ordering backups by
horizons, with preference given to closer horizons, thereby avoiding
many unnecessary and incorrect backups. We also compare
to related work, including prioritized and partitioned value iteration
approaches, and show that our technique performs favorably.
The techniques presented in RVI are complementary and can be
used in conjunction with previous techniques. We prove that RVI
converges and often has better (but never worse) complexity than
standard value iteration. To the authors’ knowledge, this is the first
comprehensive theoretical and empirical treatment of such an approach
to value iteration. |
| URI: | http://hdl.handle.net/1853/19893 |
| Appears in Collections: | School of Interactive Computing Technical Reports
|
Items in SMARTech are protected by copyright, with all rights reserved, unless otherwise indicated.
|