Dinesh Jayaraman
Dinesh Jayaraman
Home
Research Group
Publications
Teaching
Safety
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Yecheng Jason Ma
,
Andrew Shen
,
Osbert Bastani
,
Dinesh Jayaraman
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
,
Dinesh Jayaraman
,
Osbert Bastani
Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings
How to train RL agents safely? We propose to pretrain a model-based agent in a mix of sandbox environments, then plan pessimistically when finetuning in the target environment.
Jesse Zhang
,
Brian Cheung
,
Chelsea Finn
,
Sergey Levine
,
Dinesh Jayaraman
Cite
×