openclimatefix / ocf_datapipes

OCF's DataPipe based dataloader for training and inference
MIT License
13 stars 11 forks source link

Use Threadpool Map to load Zarr concurrently #180

Closed jacobbieker closed 1 year ago

jacobbieker commented 1 year ago

Relates to #177

Pull Request

Description

This PR uses threadpool in the PVMetNet datapipe to try loading the spatially and temporally sliced data in parallel using threads

Fixes #

How Has This Been Tested?

Running the pipeline and comparing it to the default

Checklist:

codecov[bot] commented 1 year ago

Codecov Report

Merging #180 (78a8259) into main (1a391bc) will decrease coverage by 1.01%. The diff coverage is 22.22%.

@@            Coverage Diff             @@
##             main     #180      +/-   ##
==========================================
- Coverage   78.86%   77.85%   -1.01%     
==========================================
  Files         122      123       +1     
  Lines        4967     5054      +87     
==========================================
+ Hits         3917     3935      +18     
- Misses       1050     1119      +69     
Impacted Files Coverage Δ
ocf_datapipes/utils/future.py 19.75% <19.75%> (ø)
ocf_datapipes/training/metnet_pv_site.py 24.35% <28.57%> (+0.74%) :arrow_up:
ocf_datapipes/training/common.py 100.00% <100.00%> (ø)

:mega: We’re building smart automated test selection to slash your CI/CD build times. Learn more