ebi-ait / hca-ebi-wrangler-central

This repo is for tracking work related to wrangling datasets for the HCA, associated tasks and for maintaining related documentation.
https://ebi-ait.github.io/hca-ebi-wrangler-central/
Apache License 2.0
7 stars 2 forks source link

phs001457.v1.p1 - TubularCellLupusNephritis #1037

Open ofanobilbao opened 1 year ago

ofanobilbao commented 1 year ago

Project short name:

TubularCellLupusNephritis

Primary Wrangler:

Ida

Secondary Wrangler:

@Wkt8

Link to Ingest

https://contribute.data.humancellatlas.org/projects/detail?uuid=97fca723-d9e9-4263-9f67-335416086f47&tab=project

Associated files

Published study links

Key Events

idazucchi commented 1 year ago

The paper and the metadata table mentions 8 blood samples, but there are no cell counts available for them so I've excluded them from the dataset

idazucchi commented 1 year ago

this is a large dataset and the graph validation is failing --> ops ticket

idazucchi commented 1 year ago

passed graph validation locally, still need to update the status in ingest but it's ready for secondary review!

Wkt8 commented 1 year ago

Secondary Review: Donor_Organism: Donors with Ethnicity.Text = Black have Ontology ID and Ontology Label empty. We should add Ethnicity Ontology ID and Ethnicity Ontology Label Ontology ID for African-American from HANCESTRO

Good job!

idazucchi commented 1 year ago

I've edited the spreadsheet but the dataset is stuck in graph validating and I can't update the project I ned dev help to:

  1. put the project back to metadata valid
  2. put the projet to graph valid once I've done the update - the project gets stuck due to its size but passed graph validation locally
idazucchi commented 1 year ago

I've added the missing ontology terms This needs to be moved to graph valid by a dev since the graph validation gets stuck in ingest

Something weird with the biomaterials: submission: 19293 spreadsheet: 19293 api: 19293 project, metadata tab: 19317

I'd like to make sure there are no extra biomaterials in the submission before we export

idazucchi commented 1 year ago

I've followed Amnon's suggestion: I deleted the submission and re-uploaded the spreadsheet

idazucchi commented 1 year ago

Project is graph valid - currently waiting for ebi-ait/dcp-ingest-central#926 to be in production before trying to export This project has ~19.000 biomaterials and ~19.000 links so it will likely fail spreadsheet generation due to the large linkingMap

idazucchi commented 1 year ago

Exporting for R27

arschat commented 1 year ago

Check if exported @Wkt8

Wkt8 commented 1 year ago

Did not export - needs re-exporting

idazucchi commented 1 year ago

I've hit export ~11.30am but no files have been exported, are the export jobs stuck in a queue? did the export fail for other reasons?

ofanobilbao commented 1 year ago

Amnon still has not found the cause. He will detach investigation from the release and this dataset. Export seems fine in his local computer. @amnonkhen to export it and let @idazucchi know when it's ready to submit import form

gabsie commented 1 year ago

This dataset is still problematic and is at the end of the queue.

idazucchi commented 1 year ago

Tried to export again at the end of R29 but the export got stuck again