chufanchen / read-paper-and-code

0 stars 0 forks source link

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL #38

Open chufanchen opened 7 months ago

chufanchen commented 7 months ago

https://arxiv.org/abs/2403.03950