Closed Natpod closed 1 year ago
It seems that the problem in calcScore
is that it does not recognize many gene symbol identifiers from my ranked expression dataset, so when it reasigns the gene symbols to the ES score df it sets off a data.frame assignment error...
The thing is that I do not want any ID conversion following the score calculations whatsoever, because many genes from my dataset will be discarded in this conversion step for my non-model organism (Brassica napus). This is because there are not many cross annotations between annotated genes in ENA / EMBL databases and NCBI databases for this organism.
Is there any way you can add a functionality that avoids this automatic ID conversion to gene symbols?
a proof of this is that when I try plotting by "fgsea" plots, the gene ranks are plotted correctly without problem. So i'm guessing it has to do with that traceback step of ID conversion, as the 'fgsea' plot functionality does not involve this step
Thank you
Fixed, please use version >= 1.2.0.
plotGSEA(gse, plot_type = "classic",
show_pathway = c('GO:0045490','GO:1901136'),
show_gene = c('BnaA06g28350D','BnaA01g34980D'))
Splendid, thank you for the fast response!
Describe the bug Hello, I was planning to coerce a fgsea (preranked gsea) result onto a plotEnrich function for plotting, with a previous step of gene count and GeneRatio calculation, geneID_symbol mapping and column name changes so that the dataframe looked identical to the model dataframe returned by genGSEA (which I find less flexible than fgsea).
However, when I attempted plotting the results for a single category which I checked was in the gsea_df, i recieved an error
With the following traceback:
To Reproduce reprex_plotGSEA_filtered.xlsx
This is my excel file representing the list of different dataframes I used after preprocessing (with names "gsea_df", "genelist", "geneset", "exponent" and "org" . I am working with Brassica napus
external_gene_name
ENA identifiersI filtered the gsea_result to having only 21 rows in order to preserve confidenciality of my results, but it still has the identifier Im looking forward to create a GSEA plot from, GO:0040029. If this is a problem for test generation, please confirm.
Additional context Any other supplements?