OpenGene / dedup

Deduplication for cfDNA sequencing data
MIT License
10 stars 4 forks source link

How about recode it to C/C++/Java #1

Open billzt opened 6 years ago

billzt commented 6 years ago

The python script is too slow (several hours) and use high memeory (~70G)

ylxdzsw commented 6 years ago

I have an optimized fork of an old version which features multi process and a C extension and is expected to be slightly faster. You can try it at https://github.com/ylxdzsw/dedup . run make before running to utilize the C extension.

billzt commented 6 years ago

Thanks, I'll have a try