Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlight)
24
stars
6
forks
source link
Add contrastive preference learning; BC; PVPES #11
Closed
pengzhenghao closed 6 months ago