Closed hlsafin closed 7 months ago
Hi! We're sorry that this work primarily concentrates on continuous control, so it has not been tested on tasks with discrete action space. However, the three key mechanisms we propose for DrM could well be adapted for discrete action tasks on DQN/Efficient Rainbow algorithms with some adjustments.
Has this been tested on Montezuma's revenge or Pitfall!, visual hard spare problems with discrete action space?