YuLab-SMU / GOSemSim

:golf: GO-terms Semantic Similarity Measures
https://yulab-smu.top/biomedical-knowledge-mining-book/
58 stars 26 forks source link

calculating similarity for a large set of genes #16

Open elise625 opened 6 years ago

elise625 commented 6 years ago

I'm trying to calculate a pairwise similarity using mgeneSim for ~15000 entrezgene ids but this is taking way too long. Is there another/faster way I can achieve this?

llrs commented 6 years ago

Which version are you using? In the latest release version (2.4.0) is faster than the previous one (2.2.0), as per this PR.

My estimations are that it should take now less than half the time it took. By "way too long" how much do you mean? Take into consideration that you want to do 15000^2 comparisons ~ 225 millions of comparisons!