Open Almujtaba-Yaseen opened 2 years ago
The exercise asked to implement the Monte Carlo Prediction to estimate state-action values... I implemented state values instead...
Will do the changes soon...
Just discoverd that inside the exercise file, it asks to implement a function that returns the state value function V.
But it will be nice if you implemented a function that returns Q(s, a) as well...
The exercise asked to implement the Monte Carlo Prediction to estimate state-action values... I implemented state values instead...
Will do the changes soon...