Closed fplazaonate closed 3 years ago
The problem occurs due to masking of short ORFs. Use --min-orf 1
to get the correct alignment:
Length=100
Score = 195 bits (495), Expect = 8.90e-72
Identities = 100/100 (100%), Positives = 100/100 (100%), Gaps = 0/100 (0%)
Frame = 1
Query 1 MSGARPKVRGPYHVGRLVVLAQLRSSGLVQGMISRAKVTHSKLQSLRSSRCAGVLKGMLF 180
MSGARPKVRGPYHVGRLVVLAQLRSSGLVQGMISRAKVTHSKLQSLRSSRCAGVLKGMLF
Sbjct 1 MSGARPKVRGPYHVGRLVVLAQLRSSGLVQGMISRAKVTHSKLQSLRSSRCAGVLKGMLF 60
Query 181 QIVVI\PGRVGSPKKVCPYLILLVYGRWDLISQPGVGYCK* 301
QIVVI PGRVGSPKKVCPYLILLVYGRWDLISQPGVGYCK*
Sbjct 61 QIVVI-PGRVGSPKKVCPYLILLVYGRWDLISQPGVGYCK* 100
Thanks Benjamin. I guess this should not happen by working with full contigs? Best, Florian
It can still happen if the segment between stop codons is too short, safer to use the option I suggested.
Hello,
I want diamond to find frameshifts in assemblies generated from IonTorrent data that occur at homopolymers.
I have generated a dummy example with a dna sequence and its corresponding protein.
Then, I have generated a +1 frameshift by adding a C in the original nucleotide sequence
In this case diamond fails to detect the frameshift:
I have tried to add another C to generate a +2 frameshift:
In this case diamond detects the frameshift:
Could you explain me the rationale behind this and how to fix this issue?
Thank you very much for your help, Florian