Closed mgrew closed 2 years ago
Dear Malte, first of all thank you very much for your interest in our software.
Concerning your problem with repeatModeler The problems with repeatModelersound familiar to me from other people reporting that. You may find a solution in this thread. https://github.com/DerKevinRiehl/TransposonUltimate/issues/3#issuecomment-1115257052 Does this do the trick for you?
Concerning your problem with repeatMasker For your issue with repeatMasker, may you share more files that are in the folder produced by repeatMasker? I would like to dig deeper into it, as you are the first user experiencing this issue.
Looking forward to your answer soon, Best regards, Kevin
Hi Kevin,
thanks for your quick reply! Your solution regarding repeatModeler indeed did the trick. Thanks for pointing me towards that easy fix.
I ran repeatMasker on Galaxy (v4.0.9) with the Dfam.h5 library, which produced the desired output files:
sequence.fasta (normal fasta format) sequence.fasta.masked (normal fasta format with masked Ns) sequence.fasta.cat sequence.fasta.out sequence.fasta.tbl
Thanks for your help! Best wishes, Malte
Hi again, I solved it. The .fasta.out file from the Galaxy version of repeatMasker was tab-delimited, while it had to be space-delimited for the parsing script to work. In retrospect, I could have spotted this earlier. Sorry and thanks again!
Best wishes, Malte
Hi Kevin,
thanks for creating this amazing pipeline. The approach to combine differently sensitive annotation tools is much needed and the detailed documentation allows a quick implementation even for computational rookies like me.
All tools ran successfully (RepeatMasker and ltrPred externally), however, during the "parseAnnotation" step the script gets stuck on the outputs of RepeatModeler and Repeatmasker and gives the following error messages:
I conclude that the structure of the according output files is not as expected, though I can't find any difference in format between the outputs and folder structure in testProject data and mine (only first few lines showed):
repeatModel (sequence_index-families.stk):
RepeatMasker (sequence.fasta.out)
I could of course skip these tools, but since they do a thorough TE annotation according to your paper, I'd prefer to include at least one of them.
Do you have any idea what the problem could be? Am I overlooking something?
Thanks in advance & best wishes, Malte