rnajena / viralclust

Small pipeline to cluster viral genomes based on their k-mer content. WiP
GNU General Public License v3.0
15 stars 4 forks source link

Preprocessing filter for large stretches of consecutive Ns #11

Open klamkiew opened 3 years ago

klamkiew commented 3 years ago

Even though we'd like to keep all input sequences as they are without filtering out too much, having 40% N's within a genome is of no help for anyone.