PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments
https://pufferai.github.io/
MIT License
1.23k stars 58 forks source link

Logits #67

Open thatguy11325 opened 9 months ago

thatguy11325 commented 9 months ago

Should hopefully be faster. Based on my comparison of different categorical distribution sampling methods. Wandb tests