Dfam-consortium / RepeatMasker

RepeatMasker is a program that screens DNA sequences for interspersed repeats and low complexity DNA sequences.
Other
230 stars 50 forks source link

Merging Multiple .tbl Output Files from RepeatMasker Batches #284

Open meduri-ruthwick opened 1 month ago

meduri-ruthwick commented 1 month ago

Hello,

I am currently running RepeatMasker in multiple batches due to computational resource constraints. Each batch generates several output files, including a .tbl file representing the repeat landscape.

At present, I manually merge the .tbl files by copy-pasting the contents into a common file to create a comprehensive summary of repeats across all batches. However, this manual process is time-consuming and error-prone, especially as the number of batches increases.

Goal: I would like to automate the process of merging the .tbl files generated from each batch into a single comprehensive .tbl file that represents the overall repeat landscape of the entire dataset.

Question: Could anyone suggest an efficient way to automate this merging process? Are there any tools, scripts, or workflows that can help streamline this task?

Any advice or recommendations would be greatly appreciated.

Thank you for your help!

Best regards, Ruthwick