Critic baseline for Policy Gradients

ai4co / rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

https://rl4.co

MIT License

455 stars 84 forks source link

Closed fedebotu closed 1 year ago

fedebotu commented 1 year ago

At the moment, the critic baseline is still not implemented - will be working on this alongside solving the Rollout baseline problem