shenwei356 / kmcp

Accurate metagenomic profiling && Fast large-scale sequence/genome searching
https://bioinf.shenwei.me/kmcp
MIT License
176 stars 13 forks source link

Masking prophages in bacterial genomes before building database as Phanta does #29

Open shenwei356 opened 1 year ago

shenwei356 commented 1 year ago

Should we update the genome size after removing the masked Ns?

The answer is probably yes.

shenwei356 commented 9 months ago

Available here: https://1drv.ms/u/s!Ag89cZ8NYcqtjHwpe0ND3SUEhyrp?e=QDRbEC , files are

gtdb_masked.part_1.kmcp.tar.gz
gtdb_masked.part_2.kmcp.tar.gz