[BUG] <Missing Rows in load_player_stats()> #204

Closed TheMathNinja closed 1 year ago

TheMathNinja commented 1 year ago

load_player_stats() is returning empty rows when a player plays but records no relevant box score statistics.


+     season = 2021:2022,
+     stat_type = c("defense"))

Expected Behavior

I expect this to return a row for every week a player played football. But this function entirely omits weeks a player plays but records no box score statistics. Some top DT examples in 2022: Jonathan Allen Week 7 Deforest Buckner Week 4 Chris Jones Week 15

I noticed this is also an issue for Offense. Tee Higgins doesn't get a row in Week 5 or Week 14 of 2022 even though he played both weeks.

This causes issues when calculating something like Receptions Per Game across a season (the Games denominator is wrong).


I'm wondering if joining participation data might help on this (include snaps as a variable for offense and defense?) but I'm guessing that creates more dependencies which might not be desirable.

mrcaseb commented 1 year ago

This is not a bug as nflfastR can only use pbp data to compute stats. If a player doesn't record any stat in the pbp data, there is no way to count their games correctly.

I suggest participation data kr maybe PFR stats to count games of players more precisely.