Open abaumann opened 5 months ago
@xbrianh if you're still active around these parts, would love to get your thoughts. We're seeing this issue in production Terra fairly often. Thanks!
cc @benedictpaten
Hi @aednichols It’s been a while, but I’ll try to take a look this weekend.
@abaumann @aednichols @benedictpaten
I've created a draft PR that addressed some of the ungraceful exit issues raised by @abaumann. This is by no means ready to merge but it should point you in the right direction, especially this snippet.
I'd be open to a brief discussion with whomever maintains this repo, just for old times sake :)
We have a reproducible test case from a collaborator trying to use getm within Terra for TCGA data - this happens transiently, but can be reproduced when trying to download at scale from TCGA buckets. I am not sure why it happens as there are simply Google signed urls, but getm doesn't handle the error gracefully, leaving a process open and causing the WDL task to hang indefinitely (costing $ for the user).
In order to help resolve this I wanted to check if the following could be updated in getm:
Thank you!
Here is output from getm running in very verbose logging mode which shows the error: