Closed diriano closed 5 years ago
Please contact the GeneMark developers about this issue. I don't think it is a BRAKER issue.
Hello, GeneMark developer here -- the problem might be a small contig size, by default, only contigs longer than 50k are used in GeneMark-ETP training. In your genome, this filters out almost all contigs and GeneMark fails due to lack of training data.
Try setting BRAKER's min_contig option to a smaller number, for example: --min_contig=10000.
I believe this option is only available in the development state version, you will need to clone the latest version of this repository.
Best, Tomas
Thank you for that comment, @tomasbruna ! I now mention this in the README "Common problems" section (https://github.com/Gaius-Augustus/BRAKER/commit/1e8ba0735f11b1f6d591acebc552b95b0e3ff381).
Closing the issue, now.
Katharina
Dear developers,
I am running BRAKER 2.1.2 on a plant genome with around 40K scaffolds. But the job is dying at some stage with GeneMark. GeneMark dies with the following error reported to the STDERR:
braker.log does not report any error. GeneMark-ETP.stdout has the following:
I am running BRAKER as the following:
braker.pl --etpmode --softmasking --species=$SP --genome=$ASSEMBLY --bam=${SP}.scf_gt1000bp.sorted.bam --hints=prot_hintsfile.aln2hints.gff --cores=$NSLOTS --AUGUSTUS_CONFIG_PATH=AugustusCONFIG --AUGUSTUS_BIN_PATH=/Storage/progs/Augustus-3.3.1-tag1/bin --AUGUSTUS_SCRIPTS_PATH=/Storage/progs/Augustus-3.3.1-tag1/scripts
prot_hintsfile.aln2hints.gff are hits to a related species (same family), generated in a previous run of BRAKER (unmasked genome) with GenomeThreader. The genome has been softmasked using RepeatModeller/RepeatMasker.Any suggestion on how to carry on with genome annotation is greatly appreciated. Thanks, Diego