Closed sjspielman closed 2 years ago
CC @jaclyn-taroni
The error is coming from the first step in the analysis pipeline: scripts/get-tcga-capture_kit.py
. It has something to do with scraping from this external URL : https://api.gdc.cancer.gov/files
On line 60,
gdc_response = gdc_response.json()
I am seeing gdc_response
ends up defined as
{'warnings': {}, 'data': {'hits': [], 'pagination': {'count': 0, 'page': 0, 'sort': '', 'total': 0, 'size': 5000, 'from': 0, 'pages': 0}}}
As a consequence, subsequent parsing at line 64 is yielding empty lists without expected fields; capture_kits
ends up as an empty list after the loop.
My quickest suggestion here is just to add a try
/except
where we print a message and effectively pass
on the except
(?) to avoid a CI failure. pass
is not necessarily a great choice for handling errors, though.
Noting also this script using the v14
data release.
I don't think this actually gets used anymore. So if the results are captured in the repo, I think we can deprecate the module and remove it from CI.
Closed with #1278
Recently builds are failing with this error: This module has not recently been updated so source is not clear.