change date column to str prior to writing to parquet

ImagingDataCommons / idc-index-data

Python package providing the index to query and download data hosted by the NCI Imaging Data Commons

MIT License

1 stars 4 forks source link

change date column to str prior to writing to parquet #22

Closed vkt1414 closed 7 months ago

vkt1414 commented 7 months ago

Unlike CSV, parquet stores schema info as well. Bigquery changes date columns db dtype column. This can cause inconsistencies for downstream operations like writing to pandas df.

In the future, we may need to check for any date columns in upcoming indices (pathology, clinical, etc.)