probml / pml2-book

Probabilistic Machine Learning: Advanced Topics
MIT License
1.39k stars 119 forks source link

small update to sec 35.5.3 on deadly triad #330

Closed murphyk closed 1 month ago

murphyk commented 1 month ago

Added brief discussion of gradient TD and target networks to stabilize off-policy learning.