will-maclean / sb3-burn

Implementation of stable-baselines3 in rust with burn
MIT License
11 stars 0 forks source link

DQN Training issues #38

Closed will-maclean closed 2 weeks ago

will-maclean commented 2 weeks ago

Not working well, not sure why

will-maclean commented 2 weeks ago

Fixed a buffer bug killing probe env 2 and higher, probe env 1-4 now work

will-maclean commented 2 weeks ago

Have discovered, if using relu and hidden size = 1, training can be impossible

will-maclean commented 2 weeks ago

Note: in smaller networks, esp if using the advantage DQN, sigmoid seems more stable than ReLU