Closed wangnan9394 closed 2 years ago
Does the VCF from pbsv (pbmm2 aligner) also work in this pipeline? :)
Hi @wangnan9394,
Shunhua
Hi Shunhua,
thanks a lot for this software. I would be happy to use it. Unfortunately, I get the issue mentioned previously by wangnan9394 but for test data.
Master RepeatMasker Database: /mnt/raid/sergey/miniconda/envs/TELR/share/RepeatMasker/Libraries/RepeatMaskerLib.embl ( Complete Database: dc20170127 ) Custom Repeat Library: /mnt/raid/sergey/bio-first/insertion_analysis/test_telr_default/output/intermediate_files/library.fasta
Warning...unknown stuff <
File /mnt/raid/sergey/bio-first/insertion_analysis/test_telr_default/output/intermediate_files/reads.vcf_ins.fasta appears to be empty. [Errno 2] No such file or directory: '/mnt/raid/sergey/bio-first/insertion_analysis/test_telr_default/output/intermediate_files/vcf_ins_repeatmask/reads.vcf_ins.fasta.out.gff' Repeatmasking VCF insertion sequences failed, exiting...
do you know what could be a reason for that?
By the way, when I use my own data this step seems to be passed by I get another error.
analyzing file /mnt/raid/sergey/bio-first/insertion_analysis/test/output/intermediate_files/101N_passed.part-01.te.fa identifying matches to dvir_full-size_TEs.fasta sequences in batch 1 of 1 processing output: cycle 1 cycle 2 cycle 3 cycle 4 cycle 5 cycle 6 cycle 7 cycle 8 cycle 9 cycle 10 Generating output... masking done Done
Successfully created the directory /mnt/raid/sergey/bio-first/insertion_analysis/test/output/intermediate_files/telr_reads
Usage: samtools depth [options] in1.bam [in2.bam [...]]
Options:
-a output all positions (including zero depth)
-a -a (or -aa) output absolutely all positions, including unused ref. sequences
-b list of input BAM filenames, one per line [null]
-l
The output is a simple tab-separated table with three columns: reference name, position, and coverage depth. Note that positions with zero coverage may be omitted by default; see the -a option.
/bin/sh: 1: _137386_137390:5972-6022: not found
/bin/sh: 1: _137386_137390.realign.sort.bam: not found
Traceback (most recent call last):
File "/mnt/raid/sergey/miniconda/envs/TELR/bin/telr", line 10, in
but probably I need to open another issue for this.
thanks in advance for your reply. Best, Sergei
reads.vcf_ins.fasta
), which is different from the issue reported in https://github.com/bergmanlab/TELR/issues/5#issue-928853284.
- Thanks for reporting these issues @SergeiF1987 and sorry for the late reply! Last week is a bit crazy.
- For the first issue, it appears that the insertion sequences extracted from SV workflow is empty (
reads.vcf_ins.fasta
), which is different from the issue reported in run in ERROR "Repeatmasking VCF insertion sequences failed, exiting..." #5 (comment).- To make sure I can reproduce the error, could you confirm that you are using the latest version of TELR (47a0e23) and installed the conda environment according to the README? https://github.com/bergmanlab/TELR/blob/master/docs/01_Installation.md
- Also, is the first issue reported in run in ERROR "Repeatmasking VCF insertion sequences failed, exiting..." #5 (comment) from a clean test data run (i.e. no re-run of the same job without removing existing output folder)?
- Could you send me a complete copy of the log file for the test data run? My gmail address is hanshunhua0829. Thanks a lot.
- The second issue doesn't appear to be repeatmasker related. Could you open another issue page and copy the report over to the new issue? Thanks! I will take a look there. My initial impression is that it has something to do with the format of the input reference genome.
Thanks to you reply! It seems that installation TELR via conda creates not the last version of the program. I have reinstalled it by using git clone than switch version (git checkout 47a0e23f8718df918e6f073c25130c2bdd1bd15f). Test run completed successfully but the second issue with my own data unfortunately wasn't solved. I will open a new issue for that. Thanks!
Hi, I am good at test folder. It's a great software. But, when i test a 2.2 Gb reads on my genome. The process was broken, and here is the details. Unfortunately, i could not find where is the key. Could you give me a hand?
Bests, Nan