|
Georgia Tech's Institutional Repository >
College of Computing (CoC) >
School of Interactive Computing (SIC) >
School of Interactive Computing Technical Reports >
| Title: | Anticipatory Robot Control for a Partially Observable Environment Using Episodic Memories |
| Authors: | Endo, Yoichiro |
| Subjects : | Episodic memory Instance-based learning Partially observable Markov decision process (POMDP) Recursive Bayesian filtering Robots Temporal difference learning |
| Issue Date: | 2007 |
| Publisher: | Georgia Institute of Technology |
| Series/Report no.: | SIC Technical Reports; GIT-IC-07-03 |
| Abstract: | This paper explains an episodic-memory based
approach for computing anticipatory robot behavior in a
partially observable environment. Inspired by biological
findings on the mammalian hippocampus, here, the episodic
memories retain a sequence of experienced observation,
behavior, and reward. Incorporating multiple machine learning
methods, this approach attempts to help reducing the
computational burden of the partially observable Markov
decision process (POMDP). In particular, the proposed
computational reduction techniques include: 1) abstraction of
the state space via temporal difference learning; 2) abstraction
of the action space by utilizing motor schemata; 3) narrowing
down the state space in terms of the goals by employing
instance-based learning; 4) elimination of the value-iteration by
assuming a unidirectional-linear-chaining formation of the state
space; 5) reduction of the state-estimate computation by
exploiting the property of the Poisson distribution; and 6)
trimming the history length by imposing the cap on the number
of episodes that are computed. Furthermore, claims 5) and 6)
were empirically verified, and it was confirmed that the state
estimation can be in fact computed in an O(n) time (where n is
the number of the states), more efficient than a conventional
Kalman-filter based approach of O(n2). |
| URI: | http://hdl.handle.net/1853/19887 |
| Appears in Collections: | School of Interactive Computing Technical Reports
|
Items in SMARTech are protected by copyright, with all rights reserved, unless otherwise indicated.
|