nf-core / genomeannotator

Pipeline for the identification of (coding) gene structures in draft genomes.
https://nf-co.re/genomeannotator
MIT License
16 stars 13 forks source link

Parallelize repeatmasking #17

Open marchoeppner opened 10 months ago

marchoeppner commented 10 months ago

Description of feature

Currently, RepeatMasker runs on chunks of the assembly, containing full scaffolds/chromosomes. For highly contiguous assemblies, this can mean excessively long run times. Investigate whether repeat masking can be performed on sub-regions of the scaffolds in parallel without losing repeat regions.