Metaculus / metaculus

https://www.metaculus.com
BSD 2-Clause "Simplified" License
55 stars 12 forks source link

Beta calibration curve looks different from www #480

Open SylvainChevalier opened 2 months ago

SylvainChevalier commented 2 months ago

beta: image

www: image

Let's check the maths, @lsabor.

lsabor commented 2 months ago

Somewhat fixed: the displayed Beta one is for the metaculus prediction, here is the recency-weighted community prediction version: image

here is the updated rewrite version: image

lsabor commented 2 months ago

I've moved this to medium priority. @SylvainChevalier feel free to move it back to high priority.

SylvainChevalier commented 2 months ago

As discussed yesterday, the top bin being bad and different is concerning.

George3d6 commented 2 months ago

@SylvainChevalier are the bucket the same here ? it looks like the top is different in the new one (100 vs 95)

lsabor commented 2 months ago

Update: Recency Weighted main site: image

Recency Weighted rewrite: image

MP main site: image

MP rewrite: image

lsabor commented 2 months ago

Here's what happens when I set the forecast horizon to be from open to actual close time not scheduled close time: image

What's going on here is that "coverage" in the main site for ComboPrediction is 100% if a forecast starts at open even if the question resolves early. Looks like that was our culprit...

lsabor commented 2 months ago

Well, that's definitely not the whole difference as my personal track record is still off: image

image

Note the 4th and 6th bins are much closer in the main site than the rewrite.

lsabor commented 2 months ago

merged above changes with notes. still more to do though

SylvainChevalier commented 3 weeks ago

@lsabor what was left to do?

lsabor commented 3 weeks ago

I haven't looked at this since Sep 17. The rewrite version is still different from old, and I never got to the bottom of why. I think eventually this should be looked at seriously, and the math checked over...