FrancisLeon / Reinforement-Learning-

0 stars 0 forks source link

MDP of tutorial cards #4

Open FrancisLeon opened 7 years ago

FrancisLeon commented 7 years ago

Firstly, let's recall what's the MDP: A reinforcement learning task that satisfies the Markov property is called a Markov decision process, or MDP. If the state and action spaces are finite, then it is called a finite Markov decision process (finite MDP).