Word cloud: show most over-represented words, not the most frequent ones

jokergoo / simplifyEnrichment

Simplify functional enrichment results

Other

108 stars 16 forks source link

Some terms are just common among:

all pathways from a specific database
all significant pathways someone chooses to summarise

It would be very useful if the word cloud could show n most over-represented terms (as an optional replacement for to the current n most common terms). User would just need to provide a list of pathways to use as background.

Implementation wise I imagine keeping count_word() unchanged but adding an extra step (conditional on user providing background/setting a switch argument) in anno_word_cloud(). If this sounds good I will be happy to work on it.

word freq gene 30664 protein 22489 transcript 11935 mirna 11481 family 8837 encoded 8260 encodes 7848 proteins 7586 variants 6983 involved 5223 member 4983

jokergoo / simplifyEnrichment

Word cloud: show most over-represented words, not the most frequent ones #58