cms-dpoa / cloud-processing

Exploring the usage of public cloud resources for CMS open data processing
GNU General Public License v3.0
0 stars 0 forks source link

Quantify copy time from the processing pod to nsf vs bucket #43

Open katilp opened 2 months ago

katilp commented 2 months ago

When the job processing ends (step runpfnano-template in the workflow, it copies the resulting file to the shared area (NSF vs bucket, depending on the workflow).

Quantify how long it takes to copy a typical output file to NSF and bucket, respectively.

To do this,

katilp commented 2 months ago

For nfs, check the exit timestamp for the argo job with

argo logs pfnano-process-<NNN> -n argo | grep exited | grep pfnano

(pls check)

and compare that to the closing message of the last file, for example

argo logs pfnano-process-<NNN> -n argo | grep -5 "MessageLogger Summary" | grep Closed

Probably grep Closed | tail -1 would do as well.

For the bucket workflow, you get this from the job logs, e.g.

time="2024-08-20T13:24:35.190Z" level=info msg="sub-process exited" argo=true error="<nil>"
time="2024-08-20T13:24:35.306Z" level=info msg="/code/scatter -> /var/run/argo/outputs/artifacts/code/scatter.tgz" argo=true
time="2024-08-20T13:24:35.308Z" level=info msg="Taring /code/scatter"
time="2024-08-20T13:25:15.325Z" level=info msg="archived 2 files/dirs in /code/scatter"

but do check that there's no difference in doing as for nfs