brain-bican / taxonomy-development-tools

Tools to build and edit Cell Annotation Schema taxonomies.
Apache License 2.0
3 stars 1 forks source link

S1. Preparing a new taxonomy for TDT annotation #98

Open hkir-dev opened 6 months ago

hkir-dev commented 6 months ago

Epic:

Description

User: A scientist with a lot of biological and computational skills and basic knowledge of GitHub. Use Case: I have just completed a version of a taxonomy (AIT99) that I am happy with and I’m ready to share it with colleagues for joint annotation. What I have:

Tasks

dosumis commented 6 months ago

What fields (“columns”) are required for input files to make sure my taxonomy can be annotated with TDT?

  • There are no compulsory columns. Users need to submit a configuration file that adds some general metadata and documents what the columns in the input table refer to. We will improve doc (#44). We also plan to support configuration through the UI in future.
    • How do I need to format the input files prior to sharing? Is an AIT h5ad file (+ an annotation table?) sufficient?

If the taxonomy is represented in both an AIT h5ad file AND a table - the two must be in sync - columns names must match OBS names, taxonomy structure will match. However, we test this on initial load and report any problems back to users.

What, specifically, do I share with my colleagues so that they can annotate as well?

Currently the best way to do this is to share the repo. Your colleagues can then check out the repo and set it up TDT as per doc.

I’ve heard about GitHub and PURLs and CAS but I don’t know what they are. Do I need to worry about that?

See #111

GitHub - Generally no, except that all users MUST have a GitHub Account. Users also currently need to be able to check out (clone) a GitHub repo (our doc will recommend using GitHub Desktop to do this as it wraps the command line setup & linking to account etc). In future we hope to be able to wrap this step with our UI.

PURLs - Only in as far as they provide a simple way to generate and share versions as URLs via never change.

CAS - No need to know, although computationally savvy users familiar with Python may want to learn to use CAS-Tools.