Closed kriskiil closed 6 years ago
Yes we discovered this too. We contacted the curator @happykhan and he says it is a mistake in the scheme which will be fixed.
But it did highlight that I need to add -culling_limit 1
back to my blastn
call!
Please email enterobase@warwick.ac.uk if pain persists.
In the Senterica scheme the presence of allele aroC-5 results in aroC(5,807,819), since aroC-5 is a supersequence of aroC-807 and aroC-819. Warnings are issued, but no ST is assigned. I'm at a loss as to why subsequences are included as new alleles in the MLST database, but that is of course not your fault. When that is the state of the database, however, I think it should be handled in a transparent manner.
A proposed solution would be to report only the longest exact match, when alternative alleles are covered by that match.
Example output:
Expected output of last line: