google / weather-tools

Tools to make weather data accessible and useful.
https://weather-tools.readthedocs.io/
Apache License 2.0
216 stars 40 forks source link

Faster ingestion of Zarr into BQ by converting the chunk into pd.Dataframe #415

Closed DarshanSP19 closed 2 months ago

DarshanSP19 commented 1 year ago

We're having small chunks of a dataset while processing zarr in weather-mv. These changes will convert those chunks into dataframe and then extract rows directly from dataframes. As the chunk size in our control we can control the memory consumption during the pipeline.

Considerable Points

DarshanSP19 commented 2 months ago

The optimization changes are done in #473 .