Closed harshraj22 closed 2 years ago
https://github.com/harshraj22/rl_lab/blob/a6fa93d6459a4e75fd7086af352cbd91f5e2d7ce/lab2/models/epsilon_greedy.py#L61
Verify if Epsilon Greedy uses a decaying Tempreature or the factor is just the time step.
Using decaying temperature is more effective
https://github.com/harshraj22/rl_lab/blob/a6fa93d6459a4e75fd7086af352cbd91f5e2d7ce/lab2/models/epsilon_greedy.py#L61
Verify if Epsilon Greedy uses a decaying Tempreature or the factor is just the time step.