ai4co / rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)
https://rl4.co
MIT License
455 stars 84 forks source link

Critic baseline for Policy Gradients #43

Closed fedebotu closed 1 year ago

fedebotu commented 1 year ago

At the moment, the critic baseline is still not implemented - will be working on this alongside solving the Rollout baseline problem