Closed iimog closed 5 years ago
I'm not convinced these are CRISPRs. You can see the "spacer" sequences are suspiciously similar to each other and seem to contain a lot of small repeating motifs. I think this might be that with --minNR 2
the results are a bit too loose. As an alternate source the CRISPRdb also reports no CRISPRs. I think the reason for there appearing to be many spacers is due to a part of the algorithm that attempts to find additional degenerate repeat and spacer pairs at the sides of the array, this in combination with the reduced number of initial repeats is causing some false positives.
I'm sorry for the misleading output; for best results in genomes stick with --minNR 3
or higher.
Thanks for the quick response. I'll stick to --minNR 3
from now on. Thanks again for your nice tool and helpful response.
Running
minced
on a complete Prochlorococcus genome with default parameters ends up empty. When setting--minNR 2
results are reported with Repeat counts of 4, 6, 4, and 11. From how I interpret the--minNR
parameter I expected all of those results to be found with default parameters (--minNR 3
) as well. I'm usingminced 0.3.2
withopenjdk version "1.8.0_152-release"
onNixOS 18.9
atLinux 4.14.91