blakeelias / pandemic_RL

Reinforcement learning for economically optimal pandemic response.
GNU General Public License v3.0
2 stars 1 forks source link

Faster sampling in evaluation #59

Closed blakeelias closed 3 years ago

blakeelias commented 3 years ago

Rather than computing full transition distribution and sampling from that, just sample directly from the new state distribution.