Open miilue opened 1 year ago
I am confused about Distributional DQN. Why 'next_dists' multiplied by support in the function of ’projection_distribution‘?My model got bad learning after using it. I would appreciate it if you could give me an answer in your spare time!
I am confused about the u.
I am confused about Distributional DQN. Why 'next_dists' multiplied by support in the function of ’projection_distribution‘?My model got bad learning after using it. I would appreciate it if you could give me an answer in your spare time!