PacificBiosciences / pb-metagenomics-tools

Tools and pipelines tailored to using PacBio HiFi Reads for metagenomics
BSD 3-Clause Clear License
174 stars 35 forks source link

Filter superbins #80

Closed dportik closed 3 months ago

dportik commented 3 months ago

Identifies superbins output from SemiBin2, which are >100 Mb in size. Including very large superbins (~1GB) will cause crashes in DAS_Tool. This new identification step will move all superbins to a superbin folder in the SemiBin2 output directory. These bins can be inspected to determine their contents, which can sometimes contain interesting eukaryotic genomes.