vishaalagartha / basketball_reference_scraper

A python module for scraping static and dynamic content from Basketball Reference.
MIT License
254 stars 91 forks source link

Schedule data updated to include Arena and Updated Page Headers causes get_player_suffix to hang #77

Closed justinbt21 closed 2 years ago

justinbt21 commented 2 years ago

Issues: 1) Was receiving following error/warnings when trying to get_schedule: image

2) get_player_suffix was hanging in while status_code = 200 loop because header changed. There is not longer and attributes associated to it image

Made a couple changes:

1) Updated df.append to pd.concat to remove warnings Added 'Arena' to result DataFrame since it was added on BballRef Updated del columns logic

2) Change find function to only look for 'h1'