Closed lerminin closed 1 year ago
So I must apologize, there is an error in the logic that I presented. In absence of overlap with the plasmid DB contigs will be labeled as plasmid only if they are circular and contain either a rep or mob marker sequences. So the reason why your plasmid is not classified as such is that it is not overlapping with a plasmid in the blast database.
Makes sense, thanks for the clarification
Hi there,
I'm using a custom database with mob_recon to recover plasmids from incomplete assemblies. I have a handful of linear contigs that appear to be fragments of plasmid sequences (hits to rep, MOB, MPF), but are getting classified as "chromosome" with my custom db. When I run them with mob_recon using the reference db, they get classified as plasmids.
Mob-recon contig_report.txt using custom db:
Mob-recon contig_report.txt using reference db (there are no other contigs in this assembly that get assigned to the AA710 cluster):
One explanation for this inconsistency is that there is nothing like these contigs in my custom db but there is something similar in the reference db, which is why they get labeled "plasmid" with the reference db. However based on this comment, I would expect these contigs to get classified as plasmids in novel clusters when using my custom db because they are positive for both Rep and MOB (bullet point 3), even if they are not represented in my custom db.
Is this the correct explanation for why this contig is not assigned the "novel plasmid" label in my custom db? Is a match (mash distance between 0.6 and some maximum threshold) to the db required in all circumstances to obtain the "novel plasmid cluster" classification?