Open ESapenaVentura opened 9 months ago
Projects that are not in exported status.
All set in graph valid, besides Reprogrammed_Dendritic_Cells
which needs update #1204
project_short_name | ingest_status | project_uuid | dcp | ingest |
---|---|---|---|---|
c844538b-8854-4a95-bd01-aacbaf86d97f |
||||
e255b1c6-1143-4fa6-83a8-528f15b41038 |
||||
8559a8ed-5d8c-4fb6-bde8-ab639cebf03c |
||||
41fb1734-a121-4616-95c7-3b732c9433c7 |
||||
77780d56-03c0-481f-aade-2038490cef9f |
||||
a2a2f324-cf24-409e-a859-deaee871269c |
||||
aff9c3cd-6b84-4fc2-abf2-b9c0b3038277 |
||||
c302fe54-d22d-451f-a130-e24df3d6afca |
||||
3cfcdff5-dee1-4a7b-a591-c09c6e850b11 |
||||
e526d91d-cf3a-44cb-80c5-fd7676b55a1d |
||||
78b2406d-bff2-46fc-8b61-20690e602227 |
||||
77dedd59-1376-4887-9bca-dc42b56d5b7a |
||||
5b910a43-7fb5-4ea7-b9d6-43dbd1bf2776 |
||||
9a23ac2d-93dd-4bac-9bb8-040e6426db9d |
||||
ede2e0b4-6652-464f-abbc-0b2d964a25a0 |
||||
116965f3-f094-4769-9d28-ae675c1b569c |
||||
Fetal/Maternal Interface | Graph validating | f83165c5-e2ea-4d15-a5cf-33f3550bffde |
Valid Publication Link | No Publication Data |
Enrique and I are exporting GompertsAirwatCfCells for the lung atlas, I've updated the doi as well and will check off the box as soon as the export has gone through
Hit validation for Fetal/Maternal Interface but takes some time to validate (Note to self @arschat , to see on monday if is stuck).
Update: Monday 201123 still stuck in graph validating
Update on 21 Nov 3 remaining projects
Short Name | uuid | State | Fixed doi in ingest |
---|---|---|---|
Fetal/Maternal Interface | f83165c5-e2ea-4d15-a5cf-33f3550bffde |
Graph validating | True |
humanOligodendrocytesCulture | ede2e0b4-6652-464f-abbc-0b2d964a25a0 |
Exporting | True |
Reprogrammed_Dendritic_Cells | 116965f3-f094-4769-9d28-ae675c1b569c |
Metadata valid | True |
if we have capcity this week we can look into the stuck projects - worst case scenario Fetal/Maternal Interface can be pushed to graph valid manually retry exporting for humanOligodendrocytesCulture
Short Name | uuid | State | Comment |
---|---|---|---|
Fetal/Maternal Interface | f83165c5-e2ea-4d15-a5cf-33f3550bffde |
Graph valid | Already been validated in the past - No need to re-run it now since you only modified project |
humanOligodendrocytesCulture | ede2e0b4-6652-464f-abbc-0b2d964a25a0 |
Exported | Got stuck in exporting - But project metadata has been exported, so we're cool |
Reprogrammed_Dendritic_Cells | 116965f3-f094-4769-9d28-ae675c1b569c |
Graph valid | Pushed to graph valid - Will need update in the future |
Fetal/Maternal interface dataset has the issue described for DCP1 dataset: Missing fields in the fastq metadata.There is a script to fill in the missing fields: https://github.com/ebi-ait/hca-ebi-dev-team/tree/master/scripts/fill_dcp1_file_metadata
While trying to run the script fill_dcp1_file_metadata in dry-run mode for the Fetal/Maternal Interface
, after some of the 15290 files get fixed I got the following error:
I run the script multiple times and I get that error different file each time.
Enrique has started downloading the files in order to re-submit via the update-uuids script.
current solution for Fetal/Maternal interface : reconstruct submission with the script and DCP2ify the dataset
Enrique to modify the script to create each type of entity separately while keeping track of the uuid changes - when capacity allows, probably in the new year
The script was already modified to be re-launched - and it works! However, the second part of the script creates the linking and relies on accessing several thousand of entities through the API, one by one, so it takes a big amount of time to execute
Currently modifying the script so that this data can be retrieved previously/stored during execution and accessed from memory. Will push the changes once I confirm they work
Putting the Fetal/Maternal interface on hold.
Doing it completely programmatically is borderline impossible with the amount of times the script needs to be retried and the manual work to ensure retries are clean.
I had another idea to make the process cleaner and easier: I have created another 2 scripts, complimentary to the first. Instead of submitting programmatically, we can submit to ingest, creating a submission with completely new UUIDs, and the new scripts will create a map by comparing IDs in the spreadsheet AND the submission.
Problem is, creating the submission in ingest has crashed it, giving an error and the submission can't be deleted.
I need to prioritize other work at the moment, but @arschat or @idazucchi if you can help me create the submission correctly (I can send you the spreadsheet) I should be able to finish the job!
Ida deleted the submission, and I created a new submission with this spreadsheet. fetalMaternal_interface.xlsx
However, the amount of entities crushed ingest (too many kubernetes requests).
('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))
It will be put on hold for now due to low dev capacity, but will have to tackle that again in a couple of weeks.
As discussed in ops review for the Reprogrammed_Dendritic_Cells
#1204 the protocol that caused graph invalid is fixed, and exported only metadata so the publication DOI will be updated. The whole project update however, is yet to be completed.
Since the DOI is fixed, we can consider this project as complete here.
@gabsie to inform @amnonkhen about the Fetal/ Maternal Interface problem, but put on hold for now.
Description of the issue
As per the message left by Jason on slack, there are a set of projects with DOI's that are either incorrect or need update: