small update to sec 35.5.3 on deadly triad

probml / pml2-book

Probabilistic Machine Learning: Advanced Topics

MIT License

1.39k stars 119 forks source link

Closed murphyk closed 1 month ago

murphyk commented 1 month ago

Added brief discussion of gradient TD and target networks to stabilize off-policy learning.