Open divilian opened 5 years ago
It took me a while to think about this approach. This is very decision theoretic: construct all possible alternatives in a tree over time or actions; calculate utility and regret. I did not realize I was doing it but there you have it.
Make all random number draws the same between proto and non-proto version.