ebi-ait / hca-ebi-wrangler-central

This repo is for tracking work related to wrangling datasets for the HCA, associated tasks and for maintaining related documentation.
https://ebi-ait.github.io/hca-ebi-wrangler-central/
Apache License 2.0
7 stars 2 forks source link

Triage new datasets - 56 new additions #993

Closed ESapenaVentura closed 1 year ago

ESapenaVentura commented 1 year ago

To triage

ofanobilbao commented 1 year ago

5th on the list: https://contribute.data.humancellatlas.org/projects/detail?uuid=81e5fdc9-7caf-4607-9908-04914bd8a9c9. I believe this is the pre-print for a paper that we already pushed to DCP: https://contribute.data.humancellatlas.org/projects/detail?uuid=5b328561-4a97-40ac-b7ad-6a90fc59d374&tab=project. Probably a wrangler will need to look into it. But this might keep happening. How do we want to deal with it? Do we include both DOI in the publication tab and only keep one project entry? We probably should right? Would the discovery script deal well with it?

ofanobilbao commented 1 year ago

7th on the list: https://contribute.data.humancellatlas.org/projects/detail?uuid=cc1558e6-e294-4fa8-b3df-5821fc89de26&tab=project. I've not marked for Catalogue yet. Want to confirm eligibility with the wranglers and Gabs. It all seems like Disease data

ofanobilbao commented 1 year ago

8th on the list: https://contribute.data.humancellatlas.org/projects/detail?uuid=260f11d4-5f2b-4ec1-a1b6-ad17ae40666a&tab=project. I've not marked for Catalogue yet. Want to confirm eligibility with the wranglers and Gabs. It all seems like Disease data

idazucchi commented 1 year ago

Can we put together a black-list of doi that get filtered ou by the script? This could be a temporary solution to the doi problem

10.1101/2020.01.15.897066
idazucchi commented 1 year ago

8th on the list: https://contribute.data.humancellatlas.org/projects/detail?uuid=260f11d4-5f2b-4ec1-a1b6-ad17ae40666a&tab=project. I've not marked for Catalogue yet. Want to confirm eligibility with the wranglers and Gabs. It all seems like Disease data, it's managed access and it's just one donor