brain-bican / taxonomy_curation

A repo for tracking taxonomy curation efforts - including taxonomies from published work as well as prepub.
0 stars 0 forks source link

Curate annotation transfer from MTG to Siletti taxonomies #9

Open dosumis opened 11 months ago

dosumis commented 11 months ago

Depends on

dosumis commented 8 months ago

MTG taxonomy is here: https://github.com/hkir-dev/human-neocortex-middle-temporal-gyrus Annotation transfer mappings (on name) are here: https://docs.google.com/spreadsheets/d/1cC6L9oOGk7tb8pFtBOtol-caYculMdkALcPF-MxhRmM/edit#gid=1294367862 (Column F) Relevant part of CAS:

transferred_annotations (list)

Example:

label: MGE_241
labelset: Cluster
accession: CS202210140_242
transferred_annotations: 
    - transferred_cell_label: sst
       source_taxonomy:  # we need to publish to PURL
       source_node_accession: 8fa477a378. # Publishing will allow us to generate a namespaced accession
       algorithm_name:  # N/A - they used a bespoke method
       comment: "We performed PCA (50 components) on our full dataset, trained a random forest classifier (scikit-learn, class_ weight=‘balanced’, max_depth=50) on the MTG labels, and then predicted labels for all cells. We labeled each cluster with the mode of its constituent cells if two conditions were met: more than 0.8 of predicted labels matched the mode, and the mean probability of these pre- dictions was greater than 0.8."

Strategy:

  1. Work branches on Siletti repos # Turns out that this breaks TDT (presumably unless everything worked on is in .gitignore) Image
  2. Install CAS-tools. Map AT on names (tracking anything that doesn't map).
  3. Generate CAS objects & insert into
  4. Run validator built into CAS.