droher / boxball

Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.
Apache License 2.0
120 stars 16 forks source link

Missing Retrosheet player data #69

Closed smcpeck closed 1 year ago

smcpeck commented 1 year ago

Describe the bug When analyzing RetroSheet data, there is no way to join to player information.

Additional context Retrosheet offers a biofile.csv data source that does not come in from the upstream Chadwick repository. This data would be super helpful to have as a table within the retrosheet schema.

Reference: https://www.retrosheet.org/biofile.htm

droher commented 1 year ago

What about the People table in the baseball databank schema? It has a retroid field.

smcpeck commented 1 year ago

Ahhh! Perfect. I missed that column and was trying to join directly to player_id.

Many thanks!