Closed matrs closed 5 years ago
Thanks a lot for reporting this.
Apparently, org.Sc.sgd.db
is the only org.db package that doesn't contain a SYMBOL
column.
For org.Sc.sgd.db
the gene symbol seems to be rather stored in the GENENAME
column
> AnnotationDbi::mapIds(org.Sc.sgd.db, keytype="ENTREZID", keys=c("853972","851092","855323"), column="GENENAME")
'select()' returned 1:1 mapping between keys and columns
853972 851092 855323
"PCK1" "FBP1" "CAT8"
and the gene name in the DESCRIPTION
column
> AnnotationDbi::mapIds(org.Sc.sgd.db, keytype="ENTREZID", keys=c("853972","851092","855323"), column="DESCRIPTION")
'select()' returned 1:1 mapping between keys and columns
853972
"Phosphoenolpyruvate carboxykinase; key enzyme in gluconeogenesis, catalyzes early reaction in carbohydrate biosynthesis, glucose represses transcription and accelerates mRNA degradation, regulated by Mcm1p and Cat8p, located in the cytosol"
851092
"Fructose-1,6-bisphosphatase; key regulatory enzyme in the gluconeogenesis pathway, required for glucose metabolism; undergoes either proteasome-mediated or autophagy-mediated degradation depending on growth conditions; glucose starvation results in redistribution to the periplasm; interacts with Vid30p"
855323
"Zinc cluster transcriptional activator; necessary for derepression of a variety of genes under non-fermentative growth conditions, active after diauxic shift, binds carbon source responsive elements; relative distribution to the nucleus increases upon DNA replication stress"
I will ask on the Bioc-devel mailing list where this inconsistency comes from.
As a work-around, you can do the following:
configEBrowser("SYM.COL", "GENENAME")
configEBrowser("GN.COL", "DESCRIPTION")
and try running
eaBrowse(sbea_test)
again.
You can restore the default configuration settings via
configEBrowser()
Note that I just introduced access to these columns, so you will need the latest release version of EnrichmentBrowser (v2.14.2) which will become available under http://bioconductor.org/packages/EnrichmentBrowser within 24 hours.
If you want to try it out right away you can install the devel version (v2.15.6) directly from github via:
BiocManager::install("lgeistlinger/EnrichmentBrowser")
Hey, thank you for your quick answer, I'll try it and I'll let you known
About the missing SYMBOL
column, maybe It's because there are too many missing mappings between any organizations' IDs and HUGO names?
> entrez2genename <- (AnnotationDbi::mapIds(org.Sc.sgd.db, keytype="ENTREZID", keys=rownames(se_test), column="GENENAME"))
'select()' returned 1:1 mapping between keys and columns
> sum(is.na(entrez2genename))
[1] 603
I think the issue comes from the fact that org.Sc.sgd.db
is created based on data from the Saccharomyces Genome Database (SGD), as opposed to other org.db packages such as org.Hs.eg.db
that are created based on data from the NCBI.
And for some reason (likely historical), the SGD uses gene name for what is typically referred to by gene symbol (see eg the SGD entry for PCK1).
I am closing this. Feel free to reopen if you find the suggested work-around to not work for your purposes.
Hello, First, thanks for your package, it's been really useful for me. I've been working with RNA-seq from Saccharomyces cerevisiae. I had a pipeline based on
edgeR
so I take the results from it and made aSummarizedExperiment
object as specified in theEnrichmentBrowser
manual.When I run
Everything appears to be OK. When I look at the results, I got:
But when I try
eaBrowse
, it fails:So I suppose this error is related to the fact that the "org" S. cerevisiae db,
org.Sc.sgd.db
, doesn't have a column forSYMBOL
Is this needed to build the links between "gene names/symbols" and the pages at NCBI showed in the volcano plots? If that's the case, could it be possible to use the already available "entrez ids" to do this?