sportsdataverse / sportsdataverse-py

sportsdataverse python package
https://py.sportsdataverse.org
MIT License
78 stars 8 forks source link

Non-matching names in CFB player data #6

Open christophermclement opened 2 years ago

christophermclement commented 2 years ago

When you pull cfb_rosters you can't link those players back to there teams in some cases because the school ID isn't used, just the name, and the names sometimes don't match any of the variations in the teams data, even if you join both the cfbd and espn names

The non-matching teams are:

{'Louisiana Monroe', 'St Francis (PA)', 'Sam Houston State', 'Southeastern Louisiana', 'Connecticut', 'UT San Antonio', 'Prairie View', 'Southern Mississippi', 'Presbyterian College'}