chanzuckerberg / single-cell-curation

Code and documentation for the curation of cellxgene datasets
MIT License
38 stars 23 forks source link

Draft Danio rerio (zebrafish) #995

Closed brianraymor closed 1 week ago

brianraymor commented 2 months ago

Design

October 21 2024 Superseded by draft 5.2.1-experimental.md


This draft design reflects additions to corresponding sections in schema 5.2.0. Reviewers are expected to be familiar with the CELLxGENE schema.

Editorial Notes that are inlined in the design below will not be surfaced in the schema.


Required Ontologies

Ontology OBO Prefix Release Download
Zebrafish Anatomy Ontology ZFA 2022-12-09 zfa.owl

Editorial Notes

Reviewing:


Required Gene Annotations

Source Required version Download
ENSEMBL (Zebrafish) GRCz11.112 (Ensembl 112) Danio_rerio.GRCz11.112.gtf

Editorial Notes

Also see Lawson Lab zebrafish transcriptome annotation


obs (Cell Metadata)

cell_type_ontology_term_id

No schema changes are required.

Editorial Notes

Also see Add CL bridge between species-specific and taxon-neutral ontology terms


development_stage_ontology_term_id

If organism_ontolology_term_id is "NCBITaxon:7955" for Danio rerio this MUST be the most accurate ZFS term and MUST NOT be ZFS:0000000 for Unknown.

Editorial Notes

Also see Development Stages

This may be outdated, but potential recommendations:

UBERON:0000104 life cycle
    UBERON:0000068 embryo stage
        UBERON:0000106 zygote stage
        UBERON:0000107 cleavage stage
            UBERON:0007232 2 cell stage
            UBERON:0007233 4 cell stage
            UBERON:0007236 8 cell stage
            ZFS:0000005 Cleavage:16-cell (Danio)
            ZFS:0000006 Cleavage:32-cell (Danio)
            ZFS:0000007 Cleavage:64-cell (Danio)
        UBERON:0000108 blastula stage
            ZFS:0000008 Blastula:128-cell (Danio)
            ZFS:0000009 Blastula:256-cell (Danio)
            ZFS:0000010 Blastula:512-cell (Danio)
            ZFS:0000011 Blastula:1k-cell (Danio)
            ZFS:0000012 Blastula:High (Danio)
            ZFS:0000013 Blastula:Oblong (Danio)
            ZFS:0000014 Blastula:Sphere (Danio)
            ZFS:0000015 Blastula:Dome (Danio)
            ZFS:0000016 Blastula:30%-epiboly (Danio)
        UBERON:0000109 gastrula stage
            ZFS:0000017 Gastrula:50%-epiboly (Danio)
            ZFS:0000018 Gastrula:Germ-ring (Danio)
            ZFS:0000019 Gastrula:Shield (Danio)
            ZFS:0000020 Gastrula:75%-epiboly (Danio)
            ZFS:0000021 Gastrula:90%-epiboly (Danio)
            ZFS:0000022 Gastrula:Bud (Danio)
        UBERON:0000110 neurula stage
        UBERON:0000111 organogenesis stage
            ZFS:0000049 segmentation (Danio)
                ZFS:0000023 Segmentation:1-4 somites (Danio)
                ZFS:0000024 Segmentation:5-9 somites (Danio)
                ZFS:0000025 Segmentation:10-13 somites (Danio)
                ZFS:0000026 Segmentation:14-19 somites (Danio)
                ZFS:0000027 Segmentation:20-25 somites (Danio)
                ZFS:0000028 Segmentation:26+ somites (Danio)
            UBERON:0004707 pharyngula stage
                ZFS:0000029 Pharyngula:Prim-5 (Danio)
                ZFS:0000030 Pharyngula:Prim-15 (Danio)
                ZFS:0000031 Pharyngula:Prim-25 (Danio)
                ZFS:0000032 Pharyngula:High-pec (Danio)
    UBERON:0000092 post-embryonic stage
        ZFS:0007000 hatching (Danio)
            ZFS:0000033 Hatching:Long-pec (Danio)
            ZFS:0000034 Hatching:Pec-fin (Danio)
        UBERON:0000069 larval stage
            ZFS:0000035 Larval:Protruding-mouth (Danio)
            ZFS:0000036 Larval:Day 4 (Danio)
            ZFS:0000037 Larval:Day 5 (Danio)
            ZFS:0000038 Larval:Day 6 (Danio)
            ZFS:0000039 Larval:Days 7-13 (Danio)
            ZFS:0000040 Larval:Days 14-20 (Danio)
            ZFS:0000041 Larval:Days 21-29 (Danio)
        UBERON:0000066 fully formed stage
            UBERON:0000112 sexually immature stage
                ZFS:0000042 Juvenile:Days 30-44 (Danio)
                ZFS:0000043 Juvenile:Days 45-89 (Danio)
            UBERON:0000113 post-juvenile

disease_ontology_term_id

No schema changes are required.

Editorial Notes

Propose to replace Disease Ontology with MONDO


organism_ontolology_term_id

organism_ontolology_term_id is "NCBITaxon:7955" for Danio rerio


sex_ontology_term_id

No schema changes are required.

Editorial Notes

There is ZFA:male organism and ZFA:female organism.


organism_cell_type_ontology_term_id

If organism_ontolology_term_id is "NCBITaxon:7955" for Danio rerio this MUST be the most accurate descendant of ZFA:0009000 for cell or "unknown" ...

Editorial Notes

Also see Add CL bridge between species-specific and taxon-neutral ontology terms


organism_tissue_ontology_term_id

If organism_ontolology_term_id is "NCBITaxon:7955" for Danio rerio this MUST be the most accurate descendant of ZFA:0100000 for zebrafish anatomical entity and MUST NOT be ZFA:0009000 for cell or any of its descendants.

Editorial Notes

There is the potential to block other terms after review.

This allows the use of ZFA:0001093 for unspecified

Also see Add UBERON bridge between species-specific and taxon-neutral ontology terms


tissue_ontology_term_id

No schema changes are required.

Editorial Notes

Also see Add UBERON bridge between species-specific and taxon-neutral ontology terms


tissue_type

Editorial Notes

Pending discussion about the requirements for organoids and cell cultures.


var and raw.var (Gene Metadata)

feature_reference

Key feature_reference
Annotator CELLxGENE Discover MUST annotate.
Value str. This MUST be the reference organism for a feature:

Reference Organism MUST Use
Danio rerio "NCBITaxon:7955"


References

Zebrafish Information Network Zebrafish information network, the knowledgebase for Danio rerio research ZFIN Data Load Sheets Template

From multiallele fish to nonstandard environments, how ZFIN assigns phenotypes, human disease models, and gene expression annotations to genes

Alliance of Genome Resources

Zebrafish Book

Zebrahub and Zebrahub – Multimodal Zebrafish Developmental Atlas Reveals the State-Transition Dynamics of Late-Vertebrate Pluripotent Axial Progenitors

A single-cell transcriptome atlas for zebrafish development

ZSCAPE - Zebrafish Atlas of Perturbed Embryos

The zebrafish anatomy and stage ontologies: representing the anatomy and development of Danio rerio

Line Designations and Wild-type Lines. Also Variants/Alleles.

A single cell transcriptome atlas of the developing zebrafish hindbrain

Demystifying Zebrafish

From Keir Balla: Tracing cell-type evolution by cross-species comparison of cell atlases An improved zebrafish transcriptome annotation for sensitive and comprehensive detection of cell type-specific genes

BGEE Reference

nayib-jose-gloria commented 1 week ago

sex_ontology_term_id No schema changes are required. Editorial Notes There is ZFA:male organism and ZFA:female organism.

@brianraymor to clarify, we are currently choosing not to accept these ZFA terms for this first run at an experimental validator, correct? We could do so, but it would violate schema 5.2.0 as written, as I believe they are not descendants of PATO:0001894

brianraymor commented 1 week ago

@nayib-jose-gloria - you can safely ignore Editorial Notes which are just my musings/research. That's why I wrote "No schema changes are required.". Per the DP call, I am drafting an experimental schema which (fingers crossed) will be ready by EOD.