metadriverse / pvp

Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlight)
https://metadriverse.github.io/pvp/
24 stars 6 forks source link

Add contrastive preference learning; BC; PVPES #11

Closed pengzhenghao closed 6 months ago