pedronachtigall / CodAn

CDS prediction in transcripts
Other
23 stars 5 forks source link

Found * in middle of the output PEP sequences #15

Open sivasubramanics opened 4 months ago

sivasubramanics commented 4 months ago

Hi, Thanks for the great tool! I just noticed that some of the output protein sequences contain "" in between the output PEP sequences. What could be the reason? The tail '' defines the STOP codan i assume, what about the initial one.

eg:

>IL_g8815.i10
M*VGKVLRVQKVKRNGTNYEEVPSDLSVHEGLFWNEDCDLLSAPTREIAEHLY*

Cheers, Siva

pedronachtigall commented 2 weeks ago

Hi @sivasubramanics

The automatic translation function in CodAn uses the standard code table. This predicted CDS can be from some organelle genome (e.g., mitochondrial). This can be the reason behind this behavior.

Best, Pedro