Open CarterFendley opened 2 years ago
The updating behavior of QTable here doesn't seem to take the difference between observed q values and most recent ones.
Check the math.
The updating behavior of QTable here doesn't seem to take the difference between observed q values and most recent ones.
Check the math.