dePamphilis / PlantTribes

PlantTribes is a collection of automated gene family analysis pipelines for comparative plant genomics
GNU General Public License v3.0
32 stars 18 forks source link

KaKsAnalysis pipeline crashes when dealing with proteins that have many alanines, threonines, glycines or cysteines #24

Open shelkmike opened 1 year ago

shelkmike commented 1 year ago

By default, MAFFT automatically determines whether a user has provided amino acid sequences or nucleotide sequences. Sometimes, MAFFT mistakes protein sequences for DNA sequences if a protein has many amino acids A, T, G, C. There were such proteins in my dataset which caused the KaKsAnalysis pipeline to crash. To fix this error, I had to add "--amino" to the command that runs MAFFT in the source code of the KaKsAnalysis pipeline.

Huiting120 commented 1 year ago

Thanks for pointing that out! We really appreciate your feedback.