Closed backeb closed 2 years ago
Apologies for not being able to attend on 17/1 at 9.30. Looking at my task above. In case the API being referred is this one: https://www.ecmwf.int/en/forecasts/access-forecasts/ecmwf-web-api then the protocol being used is WebDAV (HTTP(S)). It appears to rely on basic GET and POST, see https://github.com/ecmwf/ecmwf-api-client/blob/master/ecmwfapi/api.py
Not sure if this is of interest, depends on the amount of data to be downloaded, but there are some interesting comments regarding group (bulk) downloads and efficiency here: https://confluence.ecmwf.int/display/WEBAPI/Retrieval+efficiency
@avgils it may be interesting to know whether container security (e.g., store creds in HashiCorp Vault) is still something you would be interested in. @avgils in case it turns out that snakemake is something you may want to pursue and need help with then let me know. I do not know it well myself, but i have some colleagues that may be able to answer questions.
@backeb at the end of the sprint week or during post-sprint recap it may be nice if the LSDA could provide a brief overview of solution implemented to date. Knowing a bit more about the technical setup and usage would allow me to comment in a bit more detail.
31 Jan - 4 Feb
Sprint 3: 17-21 Jan 2022
Sprint objectives and tasks
Automate the running of WFLOW and the downloading of data
Workflow for WFLOW + pre/post-processing scripts
WFLOW Singularity container
@avgils developed a Singularity container WFLOW - a test run with ERA5 input data was succesful
Protocols or clients used by ECMWF API
Refer to https://github.com/c-scale-community/use-case-high-res-land-surface-drought-analysis/issues/15#issuecomment-980083634
Soil moisture data downloads from Copernicus Global Land Service
Workflow orchestration
Objectives for next sprint