Closed Nicholas-Kron closed 10 months ago
I'm confused of your goals - generally the 'update' command is where UTRs are added and incoporated into annotation and analysis - have you tried your approach with the update command?
So I should not expect any UTR information to be carried over from the external PASA run during predict? For my colleague, the initial predict run incorporated all UTRs predicted from PASA in the predict step, which were further refined with update run afterward (e.g. his predict run resulted in ~30k genes with UTRs, and update improved ~4k of them). Based on his experience I assumed something must have gone wrong that my UTR information was not incorporated during the predict step. If not incorporating UTRs during predict is normal behavior than that is my mistake and I will proceed to update.
Correct, predict
will not have any UTR information. https://funannotate.readthedocs.io/en/latest/update.html
To add UTRs from PASA compare annotations methodology, you can run funannotate update
after running predict. It will utilize the existing PASA alignments from the database and modify existing gene models including adding UTR information if it is present.
Ah I see, my mistake. Thank you for clearing that up. Must have been some miscommunication on our part. I will proceed to update then. Thank you for your assistance!
Are you using the latest release? version used: Funannotate 1.8.15 (latest) via mamba
Describe the bug funannotate predict did not incorporate UTR information from externally generated PASA gff3 file. PASA v2.5.2 run externally annotated 38,552 transcripts as having complete UTRs. Funannotate output reports 0 genes with 5', 3', or complete UTRs. The prediction seems to have worked fine, lots of predicted genes, just no UTRs. A colleague running the same scripts with an external PASA transcriptome on the same HPC cluster with a different organims/genome did not encounter this issue. I have to rerun anyway because GeneMark install broke for some reason, but I don't think that should have affected incorporating the PASA UTR annotation, right? I have done some debugging of perl as that is often the source of problems in my experience on our HPC and rerunning, but wanted to see if maybe something else could have been the cause. Thanks for your help!
The UTRs are in the PASA gff3 as:
in the
pasa.training.tmp.gtf
they are listed:What command did you issue?
Logfiles Please provide relavent log files of the error.
funannotate-predict.log
OS/Install Information