Improve a2c - Githubissues

MotoShin / deep-reinforcement-learning-list

0 stars 0 forks source link

Closed MotoShin closed 3 years ago

MotoShin commented 3 years ago

a2cを改善していく

MotoShin commented 3 years ago

worker agent のメモリの格納方法が間違っている気がする論文だと下記になっている

until terminal s_t or t - t_start == t_max