* Computer Science and Engineering, University of Michigan
+ Mathematics, University of Michigan
^ Robotics, University of Michigan
"Learning is by nature, curiosity" - Plato
Reinforcement learning has made great strides at exploring virtual environments, especially those of video games because of their well structured reward systems. However, it struggles in sparse reward scenarios. Curiosity driven exploration, where the agent is intrinsically motivated to explore the environment, has been proposed as a solution to this issue. Curiosity is a type of intrinsic reward function which uses prediction error as reward signal. However, current implementations of curiosity based learning only predict one time step into the future. As a result, if a catastrophic event were to happen two time steps away, the agent would be none the wiser.
In this project,
Yuri Burda, Harri Edwards, Deepak Pathak, Amos Storkey, Trevor Darrell, Alexei A. Efros. Large-Scale Study of Curiosity-Driven Learning. In ICLR 2019.
We would like to thank Ruben Villegas for proposing this project and helping us understand the different concepts involved. We would also like to extend our thanks to Prof. Honglak Lee for all his help.