Now showing items 1-1 of 1
Utilizing negative policy information to accelerate reinforcement learning
(Georgia Institute of Technology, 2015-04-08)
A pilot study by Subramanian et al. on Markov decision problem task decomposition by humans revealed that participants break down tasks into both short-term subgoals with a defined end-condition (such as "go to food") and ...