Closed ipediez closed 1 year ago
Taking this dataset for secondary review
@ipediez will review today
Project
Donor
Specimen from organism
Thanks @ipediez really good points! The last one was a mistake, i would usually model it as you said re: the cell line and specimen ids in the cell suspension tab.
I checked the cell count estimates by comparing to the GEO matrix cell counts. The values pretty much correspond, with some discrepancy as expected because of quality filtering of the data post cell sorting and library preparation.
The cell counts you were unable to find makes sense because they are bulk samples that were sequenced, hence no single-cell estimates.
Here is a table just fyi of the cell counts. I have entered 33,844 as the total cell count in the project tab.
<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns="http://www.w3.org/TR/REC-html40">
GEO matrix cell counts | -- | -- | file name | GEO matrix cell count GSM4487943_Mouse_Nasal_IFNa_NS12_dge.txt.gz | 1603 GSM4487938_Mouse_Nasal_IFNa_NH11_dge.txt.gz | 1799 GSM4487941_Mouse_Nasal_IFNa_NH22_dge.txt.gz | 1467 GSM4487940_Mouse_Nasal_IFNa_NH21_dge.txt.gz | 1321 GSM4487939_Mouse_Nasal_IFNa_NH12_dge.txt.gz | 1744 GSM4487942_Mouse_Nasal_IFNa_NS11_dge.txt.gz | 1696 GSM4487944_Mouse_Nasal_IFNa_NS21_dge.txt.gz | 1453 GSM4487945_Mouse_Nasal_IFNa_NS22_dge.txt.gz | 1590 GSE148829_BEAS_Basal_Pops_TPM.txt.gz (human) | bulk GSE148829_Human1_Basal_Pops_TPM.txt.gz | bulk GSE148829_Human2_Basal_Pops_TPM.txt.gz | bulk Human Nasal ISSS | 8313 GSE148829_Human_Ileum_absorptiveAndCryptentero_dge.csv.gz | 11856 GSE148829_Human_lung_epithelial_cell_raw_counts.txt.gz | 1002 Mouse basal stim | bulk | Summary: publication versus GEO cell counts | | from supplementary table & publication (Irene's estimate) | from GEO matrix cell count Human Adult Inferior Turnibate Scraping (SeqWell1 and 3): 10.111 cells | 8313 Mouse nasal mucosa (seqWell 3): 11.738 cells | 12673 Human ileal small intestine (10x v2 3'): 22.220 cells | 11856 Human lung (seqWell 3): 1.637 cells | 1002 | | SUM | 33844
Project short name:
CovidCellTypes
Primary Wrangler:
Ami
Secondary Wrangler:
Irene
Associated files
Google Drive https://docs.google.com/spreadsheets/d/1WiPlkBzBZBrCu4SbOt2uGCe0vIxE-bGHvB5WKrI_Rdo/edit#gid=817397388
Google Sheet https://docs.google.com/spreadsheets/d/1WiPlkBzBZBrCu4SbOt2uGCe0vIxE-bGHvB5WKrI_Rdo/edit#gid=1646211853
Ingest: Updated link to project with submission --> project
Published study links
Paper: https://doi.org/10.1016/j.cell.2020.04.035
NOTE: in the paper supplementary files, they note the analysis of already published datasets from a previous study. We didn't have that study in ingest, so I have added it as a separate project using the paper doi: Ingest:
https://contribute.data.humancellatlas.org/projects/detail?uuid=326b36bd-0975-475f-983b-56ddb8f73a4d&tab=projectPaper: https://www.nature.com/articles/s41586-018-0449-8#Sec2Key Events