StevenWingett / HiCUP

Hi-C data processing pipeline
GNU Lesser General Public License v3.0
31 stars 11 forks source link

Speed up scripts #102

Open fys666 opened 3 months ago

fys666 commented 3 months ago

The current script is too slow to process, have you considered optimizing it? Use multi-process or multi-threading to improve the speed, both the code for filtering and removing PCR duplicates are running too slow, you've done a great job, but still hope you can improve and optimize it, thanks!

fys666 commented 3 months ago

In fact, There's no need to use iterators to process line by line to save memory, everyone is rich in computational resources these days