Open Bahador-Bakhshi opened 3 years ago
In the DeepMind paper, they stated that they also explore (using the e-greedy approach with very small e) to avoid overfitting
In the DeepMind paper, they stated that they also explore (using the e-greedy approach with very small e) to avoid overfitting