AllenInstitute / abc_atlas_access

Documentation and examples demonstrating how to access data from the Allen Brain Cell Atlas
https://alleninstitute.github.io/abc_atlas_access/
Other
52 stars 22 forks source link

Add imputed notebook #54

Closed morriscb closed 3 months ago

morriscb commented 3 months ago

Add example for imputed data.

Merge after data is added to ABC Atlas.

lydiang commented 3 months ago

@morriscb - I drafted some intro section of the notebook and description markdown - let's start with these. Please fix language issues as you see fit.

Here is a draft intro section for the impute notebook

MERFISH whole brain spatial transcriptomics cells with imputed genes

In the MERFISH tutorial (part 1), we discussed how each set form the MERFISH data was mapped to the whole mouse brain taxonomy using a hierarchical correlation method. To further integrate the transcriptomics and spatial profiles of each cell type, 10Xv3 expression was projected or imputed into the MERFISH space. The basic idea is to compute the k-nearest neighbors (KNNs) among the 10Xv3 cells for each MERFISH cell and use the average expression of these neighbors for each gene as the impute values. Further details can be found in the methods section of Yao et. al.

In this notebook, we will show the basics on how to access the imputed gene data and perform some illustrative comparison between expression 10X, MERFISH measured and impute expression similar to Extended Data Fig 8 in the manuscript.


And draft intro for the data description markdown file

Imputed MERFISH spatial transcriptomics of a single adult mouse brain

The MERFISH spatial transcriptomics dataset (MERFISH-C57BL6J-638850) was mapped to the whole mouse brain taxonomy (WMB-taxonomy) using a hierarchical correlation method. To further integrate the transcriptomics and spatial profiles of each cell type, 10Xv3 expression was projected or imputed into the MERFISH space. The basic idea is to compute the k-nearest neighbors (KNNs) among the 10Xv3 cells for each MERFISH cell and use the average expression of these neighbors for each gene as the impute values. Further details can be found in the methods section of Yao et. al.

lydiang commented 3 months ago

@morriscb - feedback on the notebook

In this example, we create a dataframe comprising of expression of the our 3 selected genes for all the cells in the dataset. We can group the cells by various different metadata. For example, grouping expression by dissection region of interest shows that each of these genes have distinct spatial patterns. The MERFISH data allows us to visualize these patterns in anatomical context.

Change text from : "In this next section, we will look at the overlap of our selected genes with the C57BL6J MERSHFISH dataset."

To: "In this next section, perform some illustrative comparison between MERFISH measured and impute expression similar to Extended Data Fig 8 in the manuscript."

(Please add hyperlink to figure)

lydiang commented 3 months ago

For data description file

morriscb commented 3 months ago

Updated the files as requested. Take a look to see if you like the changes.

lydiang commented 3 months ago

More minor text edits for imputed notebook

In the description markdown file, the "Current Versions" links does not go to the right location in S3