BruceJohnJennerLawso / scrap

Hockey stats analysis done by scraping the data to a csv file, then processing/analyzing them with more python.
3 stars 0 forks source link

Use hockey summary project to fill out shot data going back to 1953 #97

Open BruceJohnJennerLawso opened 7 years ago

BruceJohnJennerLawso commented 7 years ago

The hockey-reference dataset only has shot data going back to the 1988 season, the hsp at flyershistory.com goes back much farther, to the 1953 nhl season, with period breakdowns of shot totals.

flyershistory.com/cgi-bin/hspgames.cgi

This will be tough to scrape directly, thinking some sort of socket system underneath the game object that can hook into an individual game dataset where requested, based on matching the date/time/opponent

BruceJohnJennerLawso commented 7 years ago

example of this would be

http://www.flyershistory.com/cgi-bin/poboxscore.cgi?H19520004

Goal is to get the game object to hook into H19520004 based on date and opponent

BruceJohnJennerLawso commented 7 years ago

Also, interestingly, there are some minimal pieces of box score data available in the early data under the hsp, such as here:

http://www.flyershistory.com/cgi-bin/poboxscore.cgi?H19260010

Its very inconsistent, and it seems to disappear after 1927, so initially we may just ignore that in favor of the consistent data available from 1953 onwards in box scores.