Recently I have realized that there is a more hidden parameters called maxGSSize, which really influence the result of enricher/GSEA analysis. According to the raw code in DOSE,I think it may be involving in gene sets selection before we do some enrichment analysis(like functional enrichment in GO/KEGG or GSEA) based on the number of genes in them.
In practice, more gene sets would be evaluated with a larger maxGSSize and better results would aquired sometimes.
Here are my questions:
why the default of 'maxGSSize' is 500? As I know, many gene sets (for example, in MSigDB, containing thousands of genes) have genes more than 500. Is it because the larger gene sets is not suitable for that kind of analysis(GO/KEGG/GSEA)?
Is it resonable/recommanded if a larger number for maxGSSize is set in practice?
Hi~
Recently I have realized that there is a more hidden parameters called
maxGSSize
, which really influence the result of enricher/GSEA analysis. According to the raw code in DOSE,I think it may be involving in gene sets selection before we do some enrichment analysis(like functional enrichment in GO/KEGG or GSEA) based on the number of genes in them.In practice, more gene sets would be evaluated with a larger
maxGSSize
and better results would aquired sometimes.Here are my questions:
maxGSSize
is set in practice?Thanks~