ebi-ait / hca-ebi-wrangler-central

This repo is for tracking work related to wrangling datasets for the HCA, associated tasks and for maintaining related documentation.
https://ebi-ait.github.io/hca-ebi-wrangler-central/
Apache License 2.0
7 stars 2 forks source link

EGAS00001006374 - PathogenicVariantsCardiomyopathies #1206

Open idazucchi opened 11 months ago

idazucchi commented 11 months ago

Project short name:

PathogenicVariantsCardiomyopathies

Primary Wrangler:

Ida

Secondary Wrangler:

Associated files

Published study links

Ingest

Key Events

idazucchi commented 11 months ago

was wrangled to cellxgene by Lattice, I'm checking if we can wrangle it to the DCP or if they're planning on doing it

idazucchi commented 11 months ago

Jennifer confirmed we can wrangle it

idazucchi commented 11 months ago

Controls 12 controls are reused (first 12 - see supp table 1 from HeartSingleCellsAndNucleiSeq) -- for the same reason I'm excluding HCAHeart***premrna_filtered_matrix.h5 files and BS_H15 BS_H20 BS_H25 BS_H26 BS_H35 BS_H37 I think these healthy controls are actually noted as ED* rather than BS* in the sample manifest

Medical records there is a lot of medical information/ maybe it's better to attach the info as a tsv rather than cram everything in the test result ?

Sequencer unknown but can't reach out for now because it's a wave 2 datasets

Libraries were sequenced on an Illumina HighSeq 4000 or NovaSeq with a targeted read number of 30,000-50,000 reads per nucleus

idazucchi commented 10 months ago

Ready for secondary review!

arschat commented 10 months ago

Hello Ida! Excellent work on a complex and demanding dataset! I agree on the supplementary table instead of filling the test result field with multiple data.

I will make only a couple of very minor comments:

Donor

Collection protocol

Dissociation protocol

Analysis files

Sequencing protocol

On EGA only the Illumina HiSeq 4000 sequencer is mentioned

idazucchi commented 10 months ago

applied suggestions, waiting to have permission to email authors to check the sequencer