LieberInstitute / spatialDLPFC

spatialDLPFC project involving Visium (n = 30), Visium SPG (n = 4) and snRNA-seq (n = 19) samples
http://research.libd.org/spatialDLPFC/
16 stars 3 forks source link

Explore _position_ enrichment results #140

Closed lcolladotor closed 1 year ago

lcolladotor commented 1 year ago

Now that #124 is done, we'll need your help @kmaynard12 checking the position enriched genes for either or for both the analyses with and without dropping the white matter spots (Sp02D01 spots). That is, dropping 14,353 spots (https://github.com/LieberInstitute/spatialDLPFC/blob/5a7a09f032660de72d751063796ef688c62dab6f/code/analysis/16_position_differential_expression_noWM/logs/01_create_pseudobulk_data_noWM.9.txt#L109). In these analyses we are adjusting for the spatial domain, age, and sex. While I did the analysis for k from 3 to 28, the results below are only for k = 9 (Sp09). Note that Sp09, Sp09D06 gets completely wiped out https://github.com/LieberInstitute/spatialDLPFC/blob/5a7a09f032660de72d751063796ef688c62dab6f/code/analysis/16_position_differential_expression_noWM/logs/01_create_pseudobulk_data_noWM.9.txt#L331.

To summarize a bit what we have now, here's a little list:

Here are the links to these apps and files:

With white matter

Without white matter spots (aka Sp02D01 ones)

Example

As an example, the 2nd ranked gene for anterior > rest without WM (aka https://libd.shinyapps.io/spatialDLPFC_Visium_Sp09_position_noWM/) is GPM6A https://www.genecards.org/cgi-bin/carddisp.pl?gene=GPM6A&keywords=GPM6A that has the following info:

Entrez Gene Summary for GPM6A Gene Predicted to enable calcium channel activity. Involved in neuron migration and stem cell differentiation. Located in extracellular exosome. [provided by Alliance of Genome Resources, Apr 2022]

GeneCards Summary for GPM6A Gene GPM6A (Glycoprotein M6A) is a Protein Coding gene. Diseases associated with GPM6A include Retinitis Pigmentosa 29. Gene Ontology (GO) annotations related to this gene include calcium channel activity. An important paralog of this gene is GPM6B.

UniProtKB/Swiss-Prot Summary for GPM6A Gene Involved in neuronal differentiation, including differentiation and migration of neuronal stem cells. Plays a role in neuronal plasticity and is involved in neurite and filopodia outgrowth, filopodia motility and probably synapse formation. GPM6A-induced filopodia formation involves mitogen-activated protein kinase (MAPK) and Src signaling pathways. May be involved in neuronal NGF-dependent Ca(2+) influx. May be involved in regulation of endocytosis and intracellular trafficking of G-protein-coupled receptors (GPCRs); enhances internalization and recycling of mu-type opioid receptor. ( GPM6A_HUMAN,P51674 )

Screenshot 2022-12-21 at 3 25 18 PM

Similarly, it's also the 2nd ranked gene for anterior > rest at https://libd.shinyapps.io/spatialDLPFC_Visium_Sp09_position/

Screenshot 2022-12-21 at 3 28 52 PM

Though BASP1 https://www.genecards.org/cgi-bin/carddisp.pl?gene=BASP1&keywords=BASP1 goes from 9th ranked with WM to 3rd ranked without WM.

I suspect that if we did a Venn diagram of the FDR < 5% genes for a particular statistical test (say anterior > rest), the overlap will be quite high between both analyses (with and without WM) since we are adjusting for the spatial domains while computing these statistics. We could also do scatterplots of the t-statistics. But well, for now, I think that you wanted to check some of these results first before we make any plans for new plots.

Screenshots top 25 enriched genes

With WM

Screenshot 2022-12-21 at 3 43 34 PM Screenshot 2022-12-21 at 3 42 39 PM Screenshot 2022-12-21 at 3 43 02 PM

Without WM

Screenshot 2022-12-21 at 3 44 06 PM Screenshot 2022-12-21 at 3 44 26 PM Screenshot 2022-12-21 at 3 44 50 PM

(cc @kmartinow)

lcolladotor commented 1 year ago

After meeting with @stephaniehicks @kmartinow and others, we'll go more conservative and instead of dropping Sp02D01, we'll do the k28 domains that are WM annotated.

Screenshot 2022-12-22 at 2 10 03 PM
lcolladotor commented 1 year ago

https://libd.shinyapps.io/spatialDLPFC_Visium_Sp09_position_noWM/ is now updated after this change https://github.com/LieberInstitute/spatialDLPFC/commit/d56af05d8249f4ef1b8402f6e331e9ff4b83f885. I'll post a summary later tonight.

lcolladotor commented 1 year ago

Ok, the new summary for the analysis without white matter (this time, without SP2806, 16, 17, 20 and 28 instead of Sp02D01) is:

Without white matter spots (aka SP28 ones)

Here are the updated screenshots from the top 25 (FDR < 5%) genes for the enrichment tests:

Screenshot 2022-12-22 at 5 58 05 PM Screenshot 2022-12-22 at 5 58 37 PM Screenshot 2022-12-22 at 5 59 07 PM

Notably, in posterior > rest, MBP is the 7th (versus the 6th before) and MOBP is the 13th (same rank) as before when we were dropping Sp02D01. cc @kmartinow @stephaniehicks

Gene set enrichment

Though note that the gene set enrichment that we were seeing earlier with SFARI is gone now at https://libd.shinyapps.io/spatialDLPFC_Visium_Sp09_position_noWM/. Also, the number of genes decreased a bit in this version of the no white matter analysis (vs dropping Sp02D01).

Screenshot 2022-12-22 at 6 02 22 PM

I didn't take a screenshot of the previous version of the no white matter analysis for the gene set enrichment. Here's the one from with white matter https://libd.shinyapps.io/spatialDLPFC_Visium_Sp09_position/

Screenshot 2022-12-22 at 6 04 17 PM

Current status

So right now, I'm still inclined to trust the with white matter results more than the without white matter ones. cc @lahuuki @kmaynard12. This is likely due to some WM genes showing up in some layers / spatial domains as we get closer to WM, and without having a clear WM cluster (Sp09D06) https://github.com/LieberInstitute/spatialDLPFC/blob/c8fbcbf051a13def3f57f2a4201e00ca6ac1e8da/code/analysis/16_position_differential_expression_noWM/logs/01_create_pseudobulk_data_noWM.9.txt#L333 (in this version Sp09D09 is very rare https://github.com/LieberInstitute/spatialDLPFC/blob/c8fbcbf051a13def3f57f2a4201e00ca6ac1e8da/code/analysis/16_position_differential_expression_noWM/logs/01_create_pseudobulk_data_noWM.9.txt#L336) the without white matter analysis suffers (more) at trying to avoid WM vs GM genes.

Anyways, that's one way of seeing it. There are likely other ones.

If @kmaynard12 (and/or @kmartinow) like one version of this analysis, we can move on to some ideas @lahuuki and I were discussing today. These include (I'm likely missing some other ideas):

lcolladotor commented 1 year ago

@kmaynard12 let me know if you have questions about these results.

lcolladotor commented 1 year ago

I need to make the supplementary figure describing these results after what we discussed in a recent "spatial meeting".

lcolladotor commented 1 year ago

Completed with plots at https://github.com/LieberInstitute/spatialDLPFC/tree/main/plots/09_position_differential_expression, mainly https://github.com/LieberInstitute/spatialDLPFC/blob/main/plots/09_position_differential_expression/density_histogram_enriched_tstats.pdf and https://github.com/LieberInstitute/spatialDLPFC/blob/main/plots/09_position_differential_expression/sce_pseudo_gene_explanatory_vars_k09_large.pdf