Improve Ape-X DQN Implementation

facebookresearch / rlmeta

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

MIT License

284 stars 28 forks source link

Closed xiaomengy closed 1 year ago

xiaomengy commented 1 year ago

This PR introduced several improvements for Ape-X DQN listed below.

Use pre-computed q-value to compute priority. This is mentioned in the original Ape-X paper https://arxiv.org/abs/1803.00933
Add functional regularization from https://arxiv.org/abs/2106.02613
Add SpetralNorm from https://arxiv.org/abs/2105.05246
Modify default configs.