RepeatMasker is a program that screens DNA sequences for interspersed repeats and low complexity DNA sequences.
230
stars
50
forks
source link
Linking LTRs and internal regions for ERV: Further postprocessing necessary? #288
Open
osthomas opened 1 month ago
Dear all,
I am looking into ERVs in the mouse genome (GRCm39), and I am a bit confused about relevant post processing steps.
There is a script available to combine ERV LTRs with internal regions, based on the names of the elements (https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13). However, I am not sure if this is (still?) required, or if ProcessRepeats does this already.
Here is one example from the
.out
file in which LTRs and internal regions were linked already via theID
column:In this particular case, joining by name would not even catch it.
ProcessRepeats seems to do something with LTRs/ints. Are there cases that ProcessRepeats might miss, which may benefit from further parsing?