We learn reward functions in unsupervised object keypoint space, to allow us to follow third-person demonstrations with model-based RL.
Oct 15, 15150
We demonstrate visual control within 20 seconds on a robot with unknown morphology, from a single uncalibrated RGBD camera.
May 17, 17170