Open FernandoDuarteF opened 1 month ago
To further check what's going on see branch subworkflows-agat-longest-isoform
Seems to be related to agat_sp_keep_longest_isoform.pl
filtering. Moving back to excon's GFFREAD solves the issue.
Using samplesheet.csv
to run BUSCO requires 27 GB at most. So it's not a memory problem.
I tried with only half of the protein sequences of Vespa velutina extracted with GFFREAD
and BUSCO hangs on the SEPP step. So it's not related to the file size.
When running
genomeqc
onassests/samplesheet.csv
(bees and wasps) BUSCO fails on the SEPP step with this warning message:even though 72GB of RAM are being used.
SEPP is being used to automatically infer the lineage database. When the lineage database is explcitly set, BUSCO runs successfully.