epi2me-labs / wf-16s

Other
19 stars 3 forks source link

Ways to improve minimap classification? #28

Open osvatic opened 4 weeks ago

osvatic commented 4 weeks ago

Ask away!

I am currently trying to use minimap on a ONT dataset to get a finer resolution, ideally down to species, on some full length amplicons. When I run this dataset using kraken as the classifier (with any database) I get a high percentage of the reads classified to some degree. When I run it with minimap I end up with ~99% unclassified regardless of dataset.

Are there settings that could be tweaked to adjust this? Maybe min_percent_identity?

nggvs commented 1 week ago

Hi @osvatic , Thank you very much for using the workflow! You can update to the latest version1.3.0. There is a new table that shows the unmapped sequences and there is also the number of unclassified in the rest of plots. Hopefully, this may give you an idea on how the min_percent_identity and min_ref_coverage are working and you can decrease this value if you consider it appropriate. To use the species level, you can change the taxonomic resolution to "S" (except in the SILVA database). Hope you find this useful!