davidreke / chess-ratings-backend

2 stars 1 forks source link

Remove obvious outliers #9

Open jxu opened 1 year ago

jxu commented 1 year ago

From https://www.chessratingcomparison.com/Graphs some data points are bogus, like 17 on lichess vs 1620 on chess.com, or 3100 lichess vs 1658 chess.com. These should be cleaned up to get a nicer looking plot.

CUBICinfinity commented 10 months ago

Rather than creating a new issue, I want to suggest something else that can be done instead or in addition. Use something like robust LM, or even LOESS, instead of OLS. You'll get better fit that way.

jxu commented 10 months ago

That is true; another option is linear regression with absolute error as loss as it is more robust to outliers, just as the median is versus mean. The project development seems to have stalled though.