Open startakovsky opened 7 years ago
The issue you're seeing is from the ./sigopt-beats-vegas/boxscores/scraper/srape_all
script. The logic for the scraping is in the scraper.py
module and nba.com regularly updates their API so that is likely the problem you're seeing.
While we do not regularly keep this example up-to-date, we encourage Pull Requests from the greater community.
Ran into the same problem, this really helped!
Can we close this issue?
Should we add comments to the readme maybe and resolve?
Ran into same problem, should fix or at least put into readme.
Actually tried to fix the scraper code, apparently the stats.nba pages uses a different URL and also the content within the page is js injected, so requests doesn't pull any of the stats, may need to use something else like selenium. May take a stab at it later.
The real solution here might be for us to take the original dataset and load it into a publicly available bucket on s3 :) Then the scraper is available for the curious but we don't run into so many web scraping issues
Sounds like a plan!
When I run the setup_env.sh in the sigopt beats vegas module, it just says "downloading season 1" and doesn't do anything. Are there any common problems that arise executing this?