Open brianraymor opened 8 months ago
under this proposal, a request for a specific cellxgene-schema version within the cellxgene-ontology-guide API will result in the same tissue_general data that is published in the schema.
At what point should tissue_general or any of the other hand curated lists appear in the cellxgene-ontology-guide API? Should they be accessible in the API outside of a pinned cellxgene-schema version?
On cell-science-platform, @bkmartinjr wrote:
The issue is that any given Census requires the following to be built:
- a CxG schema (which implies ontology versions) _- a specific mapping for tissuegeneral mapping
Currently, there is no versioning on the latter outside of the Census builder. We would (ideally) like to have a fully pinned specification for any given "schema" version, that includes both the ontologies, and how those derived IDs are generated (the mapping).
tissue_general is metadata that is specific to Census and its applications such as Gene Expression. Why not simply define the specific mapping in the version of the cell census schema which is updated when the dataset schema is updated?
Currently, the Census schema points to source code as documentation which is not the best of practices from my perspective. (also @bkmartinjr @pablo-gar - shouldn't the reference now be cell-census code per _WMG used to also infer tissuegeneral, but now reads it from the Census.?
For example, list the set of UBERON terms that are appropriate for census schema N.N and describe the logic for the mapping per:
Changes to the list will be reflected in new census schema versions.