google / weather-tools

Tools to make weather data accessible and useful.
https://weather-tools.readthedocs.io/
Apache License 2.0
203 stars 38 forks source link

Faster ingestion of Zarr into BQ by converting the chunk into pd.Dataframe #415

Open DarshanSP19 opened 8 months ago

DarshanSP19 commented 8 months ago

We're having small chunks of a dataset while processing zarr in weather-mv. These changes will convert those chunks into dataframe and then extract rows directly from dataframes. As the chunk size in our control we can control the memory consumption during the pipeline.

Considerable Points