Closed csthunes closed 1 year ago
Option: Create point tiers dynamically based on the data from the top 20 players for each position. Rather than set a point tier range and try and balance the numbers in each range, have it dynamically set by normal distribution percentiles of points for the given year, position, and point type.
For example, when looking at calculating the PPR tiers for RBs in 2022, we should follow the following steps:
Selecting the top however many rows of the dataset results in a very positively skewed distribution, which means it doesn't make sense to assume normality when calculating where percentiles are. This solution is better than the older way, but it needs to be tuned to be more acceptable to the skewed data.
Using median instead of mean and taking top 40 rbs and top 60 wrs instead of standard 20 like other positions alleviates much of this issue.
Closing this...good enough for now
Improve on solution for point tiers for determining the quality of games. In particular, improve discrepancies between positions and where each quality level would fall for that position.
The goal here is to make sure our score variables are well balanced between positions, so we can accurately compare players of different positions to each other.