Closed linda5mith closed 2 years ago
Hi Linda!
There is an option in the Pyrodigal results to change the letter used for unknown amino acids: it uses X
by default, but you could change it to any other letter (maybe G
?). You can check pyrodigal.Prediction.translate
.
Other option would be to enable region masking (pyrodigal.OrfFinder(mask=True)
), so that no genes can be predicted across unknown nucleotide.
Lovely, thanks for your swift reply!
There was a bug in previous versions that caused the translate
method to always ignore the replacement character, but it was fixed in v0.7.1
. Hopefully this solves your question!
I'm using PhageBoost which depends on pyrodigal to predict the protein sequence but it breaks if there is an unknown AA - 'X'. Is it possible to force prodigal to predict an AA or just exclude the unknown amino acids?