Closed ndphillips closed 7 years ago
It seems only if the high variance is huge (I used 15 with high EV of 5 and low EV of 2 with var 1) we see the shift in preference for risky options for above vs. under the goal for "ev" and "rsf" strategy.
Above Goal
Under Goal
Got it. But the difference in EVs in this example is quite large (2 vs 5). What if you changed the EVs to say 2 and 3? I'm guessing that you could then decrease the magnitude of the variances and still get the effect.
Ok I ran some more simulations.
So I think we should make sure to have a sufficiently high goal and maybe a variance of 7 or 8 to have an effect. I didn't play with the learning and exploration parameters though.
Very good! Which selection model did you use? egreedy or softmax? If you haven't already, I would make sure that the results hold for egreedy. As we discussed before, the problem with softmax is that the temperature parameter is sensitive to the scale of the expectations (which differ between EV and RSF). If we can get the results with egreedy (whose parameter is independent of the expectation scale), I think that should be sufficient for now.
But later, we'll probably need to do a more complete simulation using different softmax parameters
I used egreedy to prevent the problems we had last time when we ran the simulations with the softmax.
Questions
1) Can we show via simulation how a risk sensitive foraging strategy will outperform a straight EV maximising strategy? 2) In which statistical environments do the two strategies have different performance? 3) What aggregate descriptive differences do we see in the strategies (e.g.; option switching, proposition of high variance choices).
Create a function that takes the following arguments:
mean.v
a vector of option meanssd.v
a vector of option standard deviationstrials
number of trials in the gamegoal
the magnitude of the goalnsim
number of simulationsstrategy
Which strategy does the agent use? RSF or EV maximisingThe function should return statistics such as the probability of reaching the goal, mean points earned, option switch rate (etc.)
Strategies
EV maximising: Soft-max rule taking only means into account RSF: Soft-max taking into account likelihood of reaching the goal.
Notes