Closed AlexanderPico closed 3 years ago
these signatures contain only 3,077 unique genes
Let's just count the C2/CP collection. The other "CGP" sets are just DE gene lists from perturbation experiments and are thus not comparable to curated pathways.
https://www.gsea-msigdb.org/gsea/msigdb/genesets.jsp?collection=CP
There are 4 sets in C2_CP for Alzheimers, from 4 sources (WP, KEGG, BIOCARTA and REACTOME). Total unique genes 89. Spreadsheet here: https://www.dropbox.com/s/2hel6rllbsj0juf/MSigDB_Counts_C2-CP.xlsx?dl=0 And screenshot of the summary here:
Wait, how are the total unique only 89 when KEGG and WP each have 166 and 150?
Updated: Combined unique is 257. Updated spreadsheet.
(The 89 count were the dups, sorry)
This page (https://www.gsea-msigdb.org/gsea/msigdb/genesets.jsp?collection=C2) has 11 signatures with "Alzheimer" in their name.
Click on each to find a "download gene set" option, e.g., GMT and TXT formats.
Can you help get a count for unique genes across these 11 sets?