vishaalagartha / basketball_reference_scraper

A python module for scraping static and dynamic content from Basketball Reference.
MIT License
254 stars 91 forks source link

Player names sometimes mis-assigned in get_box_scores() #94

Open dgroshan18 opened 1 year ago

dgroshan18 commented 1 year ago

I don't understand what is happening here but it looks like a defect to me; figured I'd share this example in case anyone smarter wants to check it out...

EXAMPLE SCRIPT

from basketball_reference_scraper.box_scores import get_box_scores z = get_box_scores('2022-11-09','CHI','NOP', period='GAME', stat_type='BASIC') print(z)

Note that DeMar Derozan is returned twice, with two distinct sets of stats: image

Check basketball-reference.com, it looks like the second line should actually be assigned to Goran Dragic (note MP,FGA,FG):

image

This doesn't appear to be a one-off, seems like a recurring issue specific to certain players. Here are more player/gamedate examples if curious:

image

I also took a quick look at BOS-PHI from 10/18/2022 because Seth Curry returned two stat lines for PHI (he's not even on that team this season)... neither one is truly his, rather they should belong to P.J. Tucker and De'Anthony Melton. Very strange.

image

image

If anyone does take a look I'd be curious to hear thoughts. Thanks!