cpsievert / pitchRx

Tools for scraping MLB Gameday data and Visualizing PITCHf/x
http://cpsievert.github.io/pitchRx/
Other
124 stars 33 forks source link

Games not downloaded #52

Open iayork opened 7 years ago

iayork commented 7 years ago

Using R version 3.3.2 (2016-10-31), pitchRx 1.8.2, MacOSX 10.12.6

Example: The second game of a doubleheader between Boston and Anaheim on 7/20/2015 is not downloaded.

db <- src_sqlite("/Users/iayork/Documents/Baseball/PitchFX/pitchfx2015_TEST.db", create=TRUE)
scrape(start="2015-07-20", end="2015-07-20", connect=db$con)

Set start/end dates of 7/20/2015 and 10 games are downloaded, even though http://gd2.mlb.com/components/game/mlb/year_2015/month_07/day_20/ indicates that 11 games were played; the gameday_link "gid_2015_07_20_bosmlb_anamlb_2/" is ignored.

Other games are also ignored but I haven't tracked down which ones. Downloading all of 2015 only includes 159 games for BAL, 160 for NYA, etc.

Some doubleheaders are downloaded, so it isn't as simple as the gameday_link ending with "_2" instead of "_1".