Mar 1, 1010
We formulate homeostasis as an intrinsic motivation objective and show interesting emergent behavior from minimizing Bayesian surprise with RL across many environments.
Jan 30, 30300
Jan 1, 1010
Task-agnostic visual exploration policies may be trained through a proxy "observation completion" task that requires an agent to "paint" unobserved views given a small set of observed views.
Jan 1, 1010
Jan 1, 1010