Add region specific cortical cell types

dosumis commented 7 months ago

Jorstad, Nikolas L., Jennie Close, Nelson Johansen, Anna Marie Yanny, Eliza R. Barkan, Kyle J. Travaglini, Darren Bertagnolli, et al. 2023. “Transcriptomic Cytoarchitecture Reveals Principles of Human Neocortex Organization.” Science 382 (6667): eadf6812. https://doi.org/10.1126/science.adf6812.

This paper identifies a number of cell types that are only found in specific cortical regions - visual cortex in particular:

In order to add these to CL we would need distinguishing properties. Laminar location and grouping by well known marker (SST) may work in some cases, but in both of these cases, it is important to pay attention to the details. e.g. L4 is a transcriptomic grouping that does not correspond strictly to location in L4 - but ~to L4. SST as a transcriptomic grouping may not correspond to SST expression in all subtypes.

NS-Forest markers would be ideal solution to finding distinguishing properties @scheuerm - did you & Renne already do work on these?

I think that these are also candidates for adding data-linked definitions to terms in CL (currently only in PCL).

Potential issue - naming via simple numbering has high potential to clash between papers. NS -Forest => potential solution for that too.

### Tasks
- [ ] https://github.com/obophenotype/cell-ontology/issues/2339
- [ ] https://github.com/obophenotype/cell-ontology/issues/2343
- [ ] https://github.com/obophenotype/cell-ontology/issues/2344
- [ ] https://github.com/obophenotype/cell-ontology/issues/2345
- [ ] https://github.com/obophenotype/cell-ontology/issues/2346
- [ ] https://github.com/obophenotype/cell-ontology/issues/2355
- [x] add transcriptomic definition of cortinal Sncg interneurons
- [x] add transcriptomic definition of cortical Pax6 interneuron
- [x] add transcriptomic definition of cortical Lamp5 interneuron
- [ ] add transcriptomic definition of Lamp5 Lhx6 interneuron
- [x] add transcriptomic definition of cortical Pax6 interneuron

### Tasks

scheuerm commented 7 months ago

@dosumis We've not analyzed these data yet, but we'll look into it.

AvolaAmg commented 7 months ago

I am building a work in progress list with definition for these cortical cells, with notes for discussion and tickets. This might change as the discussion follows. The astrocytes already have a Pull Request (#2328).

Initially I started to mine the v1 specialised cells based on the relative number of nuclei dissected from areas that contribute to each cell type (Figure 5E).

However, it is important to notice that the MERFISH data and the definition of cells as "v1 specialised" refer to another set of cells and it is not necessarly based on how enlarged is the population in that are (see image below). The specialisation of cells is based on transcriptomic distinctiveness of cell types in each area compared to the same type of cell in another area. For Instance, in V1, cell types that had a different transcriptomic profile compared to the same cell types in other brain regions were considered V1 specific. These scores are present on the supplementary table S11 and they are visualised in Figure 6A.

To determine the V1 specialised cell types I suggest:

Follow the scores from table S11
Consider the MERFISH results of figure 6 to categorise V1 specialised cell types
Specify v1 specialised cell types based on their soma location in v1 and based on which layer their soma is located
Mention in the comment of the description that their specialisation was attributed to their transcriptomic distinctiveness score >0.6.
Potentially add NF forest markers scores as they would provide the minimum number of markers to define the cell types.

Side note, L5 ET projecting

L5 ET project present distinct markers for brain area, some of them are showed in Figure 7D and supplementary table S13. I suggest to include an ACC L5 ET specific type, A1 specific type, DFC specific type based on their clear markers profile.

Finally, it might be necessary to revise the grouping terms for cortical neurons some examples in this list. Happy to discuss about it.

for instance It might be necessary to modify the grouping on CL for cell types downstream of intratelencephalic-projecting glutamatergic cortical neuron and add subgrouping like i.e. IT projecting primary motor cortex neuron, IT middle temporal gyrus etc, IT visual cortex.

dosumis commented 7 months ago

From discussion in DOS group: Consensus is to add types that group all region specific clusters under a specific subclass. e.g. for L4 IT.

The challenge is that we have 2 ways to do this.

We could use cross area clustering + origin of dissection. e.g. v1 is in blue below:

=> reference = combination of L2,3,4 and 6

OR

We could use region specificity of within area clusters (fig 6 in paper). This => L1-7 for within area

With this we can also add location from MerFISH.

In both cases we could calculate NS forest markers.

IN either case, in order to add a link to data, we really need a corresponding annotation on the CxG dataset.

dosumis commented 7 months ago

For terms already in CL for cross area subclasses, we should add links to Jorstad as reference data, consider adding NS-FOREST markers. In all cases we should also review definition, e.g. definitions based on laminar location or markers are not always reliable for transcriptomically defined types.

AvolaAmg commented 7 months ago

Find attached here some discussions/note and questions to ask to the author of the paper in this document. Please comment if you want to add anything relevant to discuss with the author.

Here there is the tracker for potential cell terms to add based on the paper, there are two tabs, one refers to new terms to add based on the Cross-area taxonomy (Fig. 5E) and one refers to new terms to add based on V1 specialisation, area specific taxonomy and MERFISH data (Fig. 6).

scheuerm commented 7 months ago

Here is the email message I sent to them earlier today:

Ed/Trygve/Rebecca,

We are starting to work on preparing NS-Forest marker sets for the neocortical cell types reported in the Jorstad et al. 2023 Science paper. Before we get too far along, we wanted to get some clarification for how the different datasets (Supercluster vs Dissection) made available through CellxGene relate to each other and understand the distinction between the CrossArea_cluster and WithinArea_cluster annotations and how they related to each other. Would you have a couple of minutes to walk us through the data?

obophenotype / cell-ontology

Add region specific cortical cell types #2330