Closed SergeiF1987 closed 2 years ago
TELR.log
file under the TELR output directory?
- @SergeiF1987 Thanks for opening this issue!
- My initial impression is that the issue may be caused by input files naming or formatting, but I need more information to confirm whether this is the case for your run.
- For the real data run, could you share the
TELR.log
file under the TELR output directory?- Are the output messages you posted in after repeatmasker TELR fails with an error "/bin/sh: 1: file_name.realign.sort.bam: not found" #16 (comment) from the complete standard output/errors? If not, could you share the complete standard output/errors with me? These files should help me better understand what the problem is. If the message file is too large, you could send it to my gmail address (hanshunhua0829).
- Thanks a lot!
thanks for so quick reply! I have attached the log file. TELR.log
Not sure I understand where to find the complete standard output/errors. My TELR run created only a folder "intermediate_files" and 2 files - "log" and "loci_eval.tsv". I'm able to share everything you need but could you specify what kind of file do you want me to share?
Best, Sergei
telr -o test_output -i 101M_passed.part-01.fastq -r 9_genome_contig10.fasta -l dvir_full-size_TEs.fasta &> test.log
- It appears that the file names are not the cause of run failure. I tried renaming the input files for the test run based on your real data file names and the new test run finished successfully.
- By "standard output/errors" I mean the messages that are automatically generated in the terminal when you run TELR interactively. You could either copy the messages from terminal or redirect the both standard output and error messages into a file when you run TELR (see example below). See more in https://linuxize.com/post/bash-redirect-stderr-stdout/.
telr -o test_output -i 101M_passed.part-01.fastq -r 9_genome_contig10.fasta -l dvir_full-size_TEs.fasta &> test.log
Ohh, I get it. Here you are - test.log and terminal_output.log terminal_output.log test.log
>contig_10;
instead of >contig10
. I was able to reproduce the same error message in your run by adding an ;
in the test data.;
and get the TELR run to complete. I can also try to make an update to allow ;
in the contig name.Wow! this one solved. but it comes another one:
ERROR: Requested column 2, but database file /mnt/raid/sergey/bio-first/insertion_analysis/test_output_3/intermediate_files/liftover_report.sort.bed only has fields 1 - 0.
logs are attached. Should I open one more issue? terminal_output.log test.log
9_genome_contig10.fasta
?
- Thanks for the report! @SergeiF1987
- It's possible that this issue is still caused by the contig name format in the reference genome so we can use the same issue for now. Could you share the contig names extracted from the modified
9_genome_contig10.fasta
?- Btw, it may be helpful if I have access to the raw input files through google drive if the issue doesn't turns out to be contig name related.
sorry for the late reply. Thank you for giving much attention to my issue! raw data is here: https://drive.google.com/drive/folders/1TWYXPI4rPjxzlrl6HJ8-0KhRL-4cLraI?usp=sharing
On Jan 19, 2022 we ran the most recent TELR on the dataset provided by @SergeiF1987 and got the same error messages that you mentioned in on Oct 12, 2021. We think the issue is caused by no TE being present in the local contig that can be lifted over to the reference genome (in TELR, for every TE candidate, we align flanking sequences in the TELR-assembled local contig to the reference genome to identify precise insertion coordinate). This might be caused by nested TE or adjacent TE or true negative. We will improve the error handling at the lift over step in cases like this. Thanks for providing this useful test case!
Hi Shunhua,
Thank you for this software! At the first glance, it has everything that I was looking for identification of TE insertions including allele frequency. Test run passed successfully but when I use my own data it fails with an error which looks like this:
Could you take a look at this issue?
Thanks in advance! Best, Sergei