ndphillips / EE-Goals

0 stars 0 forks source link

Simulate divergence between RSF predictions and maximizing #6

Closed ndphillips closed 7 years ago

ndphillips commented 7 years ago

Questions

1) Can we show via simulation how a risk sensitive foraging strategy will outperform a straight EV maximising strategy? 2) In which statistical environments do the two strategies have different performance? 3) What aggregate descriptive differences do we see in the strategies (e.g.; option switching, proposition of high variance choices).

Create a function that takes the following arguments:

The function should return statistics such as the probability of reaching the goal, mean points earned, option switch rate (etc.)

Strategies

EV maximising: Soft-max rule taking only means into account RSF: Soft-max taking into account likelihood of reaching the goal.

Notes

mdsteiner commented 7 years ago

It seems only if the high variance is huge (I used 15 with high EV of 5 and low EV of 2 with var 1) we see the shift in preference for risky options for above vs. under the goal for "ev" and "rsf" strategy.

Above Goal grafik

Under Goal grafik

ndphillips commented 7 years ago

Got it. But the difference in EVs in this example is quite large (2 vs 5). What if you changed the EVs to say 2 and 3? I'm guessing that you could then decrease the magnitude of the variances and still get the effect.

mdsteiner commented 7 years ago

Ok I ran some more simulations.

So I think we should make sure to have a sufficiently high goal and maybe a variance of 7 or 8 to have an effect. I didn't play with the learning and exploration parameters though.

ndphillips commented 7 years ago

Very good! Which selection model did you use? egreedy or softmax? If you haven't already, I would make sure that the results hold for egreedy. As we discussed before, the problem with softmax is that the temperature parameter is sensitive to the scale of the expectations (which differ between EV and RSF). If we can get the results with egreedy (whose parameter is independent of the expectation scale), I think that should be sufficient for now.

But later, we'll probably need to do a more complete simulation using different softmax parameters

mdsteiner commented 7 years ago

I used egreedy to prevent the problems we had last time when we ran the simulations with the softmax.