Damcy / prioritized-experience-replay

implement of prioritized experience replay
MIT License
156 stars 39 forks source link

Execution time #4

Closed raymondchua closed 6 years ago

raymondchua commented 7 years ago

Hi, I am curious about the execution time when using prioritized experience replay. I have written the rank-bassed prioritization and now it takes a longer time to complete one epoch as instead of using mini-batch, we compute a single sample as time when training the DQN. Any thoughts?

Damcy commented 7 years ago

I never compare the performance of the two method. But I think the execution time is depends on the size of replay memory. The operations of the rank-based prioritization will affect the execution time.

faaanbingbing commented 7 years ago

你好,如果使用比较大的replay_memory_size,比如10万,100万,是会出现各种索引错误的,请问可以修正吗?

Damcy commented 6 years ago

@faaanbingbing 是这份代码有问题吗,可能是分区的部分写得有问题,但是思路应该是ok的,看可以fork过去加工一下,我这边用的没发现明显的bug。