ebi-ait / hca-ebi-wrangler-central

This repo is for tracking work related to wrangling datasets for the HCA, associated tasks and for maintaining related documentation.
https://ebi-ait.github.io/hca-ebi-wrangler-central/
Apache License 2.0
7 stars 2 forks source link

Contributor - AdultHumanBrain #1013

Open ESapenaVentura opened 1 year ago

ESapenaVentura commented 1 year ago

Project short name:

AdultHumanBrain

Primary Wrangler:

Ida

Secondary Wrangler:

TBD

Associated files

Published study links

Key Events

idazucchi commented 1 year ago

Fastq The data in nemoArchive is stored in a tree structure, with high level directories named after the 10x sample id. The samples are described in the supplementary table 1. Problem: there are 606 samples in the metadata, but just 260 directories in nemo -> I'm sending an email asking for clarification

In the meanwhile I'm filling in the biomaterial and protocol tabs, and mapping the organ part terms to the ontology (very specific neuroanatomy I'm not familiar with so it will take some time)

idazucchi commented 1 year ago

Peter confirmed that some fastq files are missing and the project is on hold until this is resolved (thread)

idazucchi commented 1 year ago

All the data is in Nemo right now, but the project has lower priority than Lung v2. I've sent an email to Peter to ask for neuroanatomical term mappings before I proceed with the work anyway

idazucchi commented 1 year ago

Data All the fastq files have been uploaded to Nemo, however the majority are under restricted access, so I still don't have full access to the raw files. I've just emailed Peter about this to see if it's intentional or if there is a different way to download the data.

Organ part ontology I've mapped all the organ part terms manuallly and sent the list for review - see it here

This is stalled untill I have access to the raw files

idazucchi commented 1 year ago

the authors sent back comments on the neuroanatomical term mappings but no information on fastqs access, so I'm still stalled

idazucchi commented 1 year ago

neuroanatomical term I checked the neuroanatomical term mappings and found that one term suggested by the authors is in OLS but not in HCAO - anterior cortical amygdaloid nucleus (UBERON:0034991)

if we'll be phasing out HCAO soon there's no point in opening a ticket about this issue

fastqs I pinged Peter again to see if we can get access to all fastq files

idazucchi commented 11 months ago

I'm trying to download the files again, it looks like they've become available now