Open koleckt opened 3 years ago
Thanks for this suggestion, we will work on a new feature that provide a way to parallelize products downloading, and keep you informed of the progress in this issue
See if there is a way to provide to the downloading method a parameter referencing to a generic pool / worker scheduler, compatible with Dask or other solutions
Ask as parameter an executor like Dask.Client
, or concurrent.futures
that use the same interfaces. See https://distributed.dask.org/en/latest/client.html
Some serializing might be needed (already implemented in eodag):
We can save a lot of processing time, specially for large dataset of product (time serie) if the downloading and the unziping (exemple of S1 products) can be done in 2 differents threads.
Actually, when downloading, the CPU is not used at full capacity, and when unzip, the network is not use.
Doing both in parallel can speedup a lot the processing codes.