Dfam-consortium / RepeatMasker

RepeatMasker is a program that screens DNA sequences for interspersed repeats and low complexity DNA sequences.
Other
230 stars 50 forks source link

Penelope elements missing from *.tbl and repeat landscapes #200

Closed TobyBaril closed 1 year ago

TobyBaril commented 1 year ago

Using RepeatMasker with a library generated by RepeatModeler, PLEs are missing from final RepeatMasker quantifications. In the consensus library, RepeatClassifier classifies these elements as family#PLE/subclass. I think this is due to RepBase PLEs having the headers as family#LINE/Penelope, so the inconsistency leads to PLEs not being quantified in the final summaries.

Reproduction steps

RepeatMasker with a library from RepeatModeler in which PLEs are named by RepeatClassifier as: family#PLE/subclass. (E.G family#PLE/Chlamys

4.1.4

Full Dfam 3.7

Ubuntu LTS 22.04

rmhubley commented 1 year ago

This is a timely question. This is an update that is scheduled for the next update of RepeatMasker which should be out in the next week.

rmhubley commented 1 year ago

This has been fixed in RepeatMasker 4.1.5 released today.