ebi-ait / hca-ebi-wrangler-central

This repo is for tracking work related to wrangling datasets for the HCA, associated tasks and for maintaining related documentation.
https://ebi-ait.github.io/hca-ebi-wrangler-central/
Apache License 2.0
7 stars 2 forks source link

GSE109816 GSE121893 - HeartReconstructionPostHF #1239

Open idazucchi opened 7 months ago

idazucchi commented 7 months ago

Project short name:

HeartReconstructionPostHF

Primary Wrangler:

Ida

Secondary Wrangler:

Associated files

Published study links

Ingest

Key Events

idazucchi commented 7 months ago

Data

I tried to download the data but it failed, I've asked for help to the SRA help desk to solve the error Enrique tried to download both accessions and failed Wei tried to download just one accession and failed

I think this is due to some error in aws - the only thing I can do is wait for SRA's reply

cell suspension

there are too many cs for the anlaysis file input --> need to make plate based CS, but some plate labels are shared between accessions

arschat commented 5 months ago

Ida tried to download the data but it failed, she asked for help to the SRA help desk to solve the error Enrique tried to download both accessions and failed Wei tried to download just one accession and failed

Arsenios tried to download just one accession and failed.

We will try another strategy, to download individual donors by searching donor name in the Run Selector search bar. Healthy donor N2 works, we will continue with other donors and track the progress here.

Healthy individuals

arschat commented 4 months ago

All files have been downloaded in the s3://hca-ncbi-cloud-data/. Created an hca-util area for HeartReconstructionPostHF 854f5cac-7550-4369-8491-415bc8f74879.

HeartReconstructionPostHF_SRR_accessions.txt HeartReconstructionPostHF_add_to_hca-util_area.txt HeartReconstructionPostHF_remove_from_cloud_delivery_area.txt

idazucchi commented 4 months ago

The submission is too large to upload to ingest - I've generated uuids for all the entities and I will need Enrique's help to generate the submission

idazucchi commented 4 months ago

generating the submission with a script is not feasible (it takes 3+ days of monitioring the script) so this dataset is stalled until we can address the reason for the timeout (?)or otherwise make sure that ingest can process large submission