iqbal-lab / outbryk

1 stars 1 forks source link

Prophage file redundancy #5

Open hcdenbakker opened 8 years ago

hcdenbakker commented 8 years ago

Current sets of prophages are probably highly redundant, should use some kind of CD-HIT like clustering procedure to get a set of unique prophage sequences

iqbal-lab commented 8 years ago

Good idea Henk CD-HIT main website here http://weizhongli-lab.org/cd-hit/, has just moved to GitHub: https://github.com/weizhongli/cdhit