Closed nasimrahaman closed 6 years ago
This is indeed intentional and based on personal correspondence - I've added a comment there and put a note in the repo wiki to reflect this. I don't actually know how much impact this has though, and I don't have the resources to test the differences.
In this line, the priorities are updated with the importance sampled weights (see this line). This does not appear to be consistent with algorithm 1 of Schaul et al. 2016 - is this intentional?
P.S. great work!