kruser / atbat-mongodb

A Perl project that pulls data from MLB's AtBat servers and shoves them into a local MongoDB
Other
33 stars 11 forks source link

Minor League Data #8

Open trojanguard25 opened 10 years ago

trojanguard25 commented 10 years ago

I would like to use this tool to download minor league data. I have some Perl skills (though I haven't used MongoDB) so I can probably figure this out on my own. But any thoughts/pointers/suggestions are appreciated.

kruser commented 10 years ago

I haven't even started looking at that. Is the minor league data in similar format to the MLB data XML files?

I don't see much information here: http://gd2.mlb.com/components/game/

trojanguard25 commented 10 years ago

I've only spent about 4 hours trying out your code and looking at the data (and to be perfectly honest, most of that time was spent trying to resize the screen on my Ubuntu Virtual Machine), so I don't have an answer to that yet. However, there does seem to be a decent amount of game data for the minor league games (ex. http://gd2.mlb.com/components/game/aaa/year_2014/month_04/day_29/gid_2014_04_29_chraaa_rocaaa_1/). I don't know if it matches the MLB format, is just a subset, or a completely different schema. I'll have to dig into it and see how it compares.

kruser commented 10 years ago

Ah, I wasn't looking there. If it is the same format then we could probably add an option to the properties file that optionally pulls in data from the different leagues. You'll find the base game day endpoint is also configures in that props file.

As for this project? You'll see I branched recently for v1.0.0. This is meant to be kind of an archive as I'm going to change the data model quite a bit to be a better dimensional model. Flatter properties and more predictable names. Also adding more properties to tge pitches collection. Just an Fyi.