Faster sampling in evaluation

blakeelias / pandemic_RL

Reinforcement learning for economically optimal pandemic response.

GNU General Public License v3.0

2 stars 1 forks source link

Closed blakeelias closed 3 years ago

blakeelias commented 3 years ago

Rather than computing full transition distribution and sampling from that, just sample directly from the new state distribution.