Open photocyte opened 2 years ago
Hi,
I was able to reproduce your error and your protein profile looks fine to me. The issue seems to be related to the UTR prediction of Augustus in combination with the protein profile mode. I recommend turning off UTR prediction, e.g.:
genome_fasta=Ilumi1.3-grep13255.fasta
prfl_file=elateroidea_luciferase_clade.msa.fa.prfl
augustus --species=fly --proteinprofile=${prfl_file} --UTR=off ${genome_fasta}
If the UTRs are important to you, I can take a closer look at the code causing this bug, but this may take some time.
Best, Lars
Thank you! I can confirm adding --UTR=off
is a workaround. I imagine the UTR training might be limited to highly curated models like --species=fly
, but shutting it off explicitly seems like good practice if there are some unexpected interactions.
Hi there,
I've made a small prototype Nextflow pipeline to run Augustus in PPX mode with custom protein profiles, via Docker or Singularity: https://github.com/photocyte/luciferase-PPX-predictor-nf
( Augustus Docker image from
quay.io/biocontainers/augustus:3.4.0--pl5321hd8b735c_3
, see https://quay.io/repository/biocontainers/augustus?tab=tags )I think I was able to make a good
.prfl
file from a custom MSA in FASTA format, but, Augustus errors out when I try to use it:The
elateroidea_luciferase_clade.msa.fa.prfl
file was made with this command:Is there something wrong with the
.prfl
file I am creating? Relevant files attatched: Archive.zip