jldbc / pybaseball

Pull current and historical baseball statistics using Python (Statcast, Baseball Reference, FanGraphs)
MIT License
1.19k stars 324 forks source link

Missing data from Statcast #272

Closed vfigueroa closed 1 year ago

vfigueroa commented 2 years ago

I have noticed that when I run data = statcast() to get yesterdays pitching data it returns an empty data frame attached are some pictures showing the last day I was able to get data(May 22nd). I am using a python notebook in VSCode and have restarted my kernel as well as removing my cache to try to troubleshoot this issue.

Screen Shot 2022-05-25 at 6 10 30 PM Screen Shot 2022-05-25 at 6 11 00 PM
tjburch commented 1 year ago

Have you found a resolution to this? I'm able to pull data from that time range.

This reads like it might have been a cache problem, maybe consider clearing that and trying again?

ksullivan1031 commented 1 year ago

I think I am seeing something similar when pulling statcast data for Aug 5,8,9. For some reason only these days are affected

Screen Shot 2022-08-11 at 1 25 34 PM

Can someone explain the proper way to clear the cache?

Thanks

tjburch commented 1 year ago

@ksullivan1031 - there should be a folder in your user directory: ~/.pybaseball/cache. Easiest way is to just clear that directory entirely (or you could try to search for the offending cached query and try to delete just it).

For more info about the cache see the docs. Those are reasonable queries so maybe for your use case you might consider just disabling it.

tjburch commented 1 year ago

Were you able to resolve? Closing if so.

tjburch commented 1 year ago

Closing due to inactivity and inability to reproduce