MrSyee / pg-is-all-you-need

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.
MIT License
848 stars 119 forks source link

Fix bugs of calculation loss #1

Closed Curt-Park closed 5 years ago