Closed rashidma closed 6 months ago
can you send the contents of the work folder in which the get_vhla
failed as tar.gz
Dear Rieder, The failed work dir is below: /mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/d4/4d565442ef4ad7f8cb1139bc838109
Please find attached the entire d4 directory.
Thank you for your time. I have been trying to run nextNEOpi on demo data since long time. Unfortunately, I am still suffering due to issues. Please help me to an extent so that I can run this on the demo successfully.
Please let me know. Best regards Mamoon
On Mon, Dec 18, 2023 at 12:40 PM Dietmar Rieder @.***> wrote:
can you send the contents of the work folder in which the get_vhla failed as tar.gz
— Reply to this email directly, view it on GitHub https://github.com/icbi-lab/nextNEOpi/issues/67#issuecomment-1859946440, or unsubscribe https://github.com/notifications/unsubscribe-auth/AE6KEOFZSVU7IK7X2IWNIDTYKAFPDAVCNFSM6AAAAABAZDJCE6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNJZHE2DMNBUGA . You are receiving this because you authored the thread.Message ID: @.***>
--
Mamoon Rashid, PhD (Bioinformatics)
Associate Research Scientist
Department of Biostatistics and Bioinformatics
King Abdullah International Medical Research Center (KAIMRC) Kingdom of Saudi Arabia
Hi, can it be that you forgot to attach?
I can see attachment in my previous email. d4.tar.gz Please check.
Mamoon
On Mon, 18 Dec 2023 at 6:20 PM Dietmar Rieder @.***> wrote:
Hi, can it be that you forgot to attach?
— Reply to this email directly, view it on GitHub https://github.com/icbi-lab/nextNEOpi/issues/67#issuecomment-1860792511, or unsubscribe https://github.com/notifications/unsubscribe-auth/AE6KEOCZX5SCPLHYLZ3UDADYKBNLFAVCNFSM6AAAAABAZDJCE6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNRQG44TENJRGE . You are receiving this because you authored the thread.Message ID: @.***>
There is no attachment in github and also not in the notification e-mail that I got from github. Sorry
Now it is there, however the symlinks are not followed, can you please create the tar.gz
using the following command:
tar -chzf work_dir.tar.gz /mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/d4/4d565442ef4ad7f8cb1139bc838109
thanks
Done. Please see below: work_dir.tar.gz
Hi and thanks,
it seems that HLA-HD
did not run successfully and created an unexpected output (test_tumor_DNA_final.result.txt
)
can you sent the contents of:
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/88/273dc3f21d5c333657d8e19200b919/test_tumor_DNA/log
and
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/88/273dc3f21d5c333657d8e19200b919/.command.err
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/88/273dc3f21d5c333657d8e19200b919/.command.log
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/88/273dc3f21d5c333657d8e19200b919/.command.out
Okay. Please have the tarballs below: log.tar.gz
I see that your hlahd
installation is missing bowtie2
on your system:
/mnt/e/Mamoon/software/HLAHD/hlahd.1.5.0/bin/hlahd.sh: line 97: bowtie2: command not found
I suggest you install bowtie2
as stated on https://w3.genome.med.kyoto-u.ac.jp/HLA-HD/
Then it should work.
Hi, I installed "bowtie2" and set the path to "bash_profile". I tested that it is working on command prompt. I also set the path of "hlahd_dir/bin" in my PATH variable.
Having said that, nextNEOpi again failed with the same error.
Error executing process > 'get_vhla (test)'
Caused by:
Process `get_vhla (test)` terminated with an error exit status (1)
Command executed:
# merging script
HLA_parser.py \
--opti_out test_optitype_result.tsv \
--hlahd_out test_tumor_DNA_final.result.txt \
--opti_out_RNA test_optitype_RNA_result.tsv \
\
\
--ref_hlas /mnt/e/Mamoon/software/nextNEOpi-master/assets/pVACseqAlleles.txt \
> ./test_hlas.txt
Command exit status:
1
Command output:
(empty)
Command error:
Traceback (most recent call last):
File "/mnt/e/Mamoon/software/nextNEOpi-master/bin/HLA_parser.py", line 423, in <module>
parse_hlahd(hlahd_dna_result, "classII", hlaII_array)
File "/mnt/e/Mamoon/software/nextNEOpi-master/bin/HLA_parser.py", line 73, in parse_hlahd
if "HLA-" in line[1]:
IndexError: list index out of range
Work dir:
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/d9/5783971477e8cda092ee113063e03a
Tip: you can try to figure out what's wrong by changing to the process work dir and showing the script file named `.command.sh`
I checked "test_tumor_DNA_final.result.txt", again failed. I am surprised with this situation.
Thanks a lot
Can you again send the contents of
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/d9/5783971477e8cda092ee113063e03a
As tar.gz with the -chzf
option
Hi and thanks,
it still seems that HLA-HD did not run successfully and created an unexpected output (test_tumor_DNA_final.result.txt)
can you sent the contents of:
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/dd/b93680559f929b4300e69e79c18561/test_tumor_DNA/log
and
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/dd/b93680559f929b4300e69e79c18561/.command.err
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/dd/b93680559f929b4300e69e79c18561/.command.log
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/dd/b93680559f929b4300e69e79c18561/.command.out
Here you go. HLAHD is 1.5, do you think I should use version 1.7?
command.err.tar.gz command.out.tar.gz command.log.tar.gz log.tar.gz
I checked the .command.err file and it looks like bowtie2 is not accessible! But, i installed it at /mnt/e/Mamoon/software/bowtie2-2.5.2-linux-x86_64/bowtie2 and included in the path in bash_profile and bashrc files.
Please comment. Thank you
Can you run hlahd
manually, I mean independent of the nextNEOpi? Is it finding bowtie2
I run hlahd manually using a demo published on their website hlahd-demo.v1.pdf
The command i used is as below :
hlahd.sh -t 4 -m 100 -f /mnt/e/Mamoon/software/HLAHD/hlahd.1.5.0/freq_data ./SRR794547_1.fastq ./SRR794547_2.fastq /mnt/e/Mamoon/software/HLAHD/hlahd.1.5.0/HLA_gene.split.txt /mnt/e/Mamoon/software/HLAHD/hlahd.1.5.0/dictionary SRR794547 .
The output as below:
HLA-HD version 1.5.0
22082652 reads; of these:
22082652 (100.00%) were unpaired; of these:
21913326 (99.23%) aligned 0 times
16595 (0.08%) aligned exactly 1 time
152731 (0.69%) aligned >1 times
0.77% overall alignment rate
22082652 reads; of these:
22082652 (100.00%) were unpaired; of these:
21910199 (99.22%) aligned 0 times
16670 (0.08%) aligned exactly 1 time
155783 (0.71%) aligned >1 times
0.78% overall alignment rate
169326 reads; of these:
169326 (100.00%) were unpaired; of these:
162436 (95.93%) aligned 0 times
2161 (1.28%) aligned exactly 1 time
4729 (2.79%) aligned >1 times
4.07% overall alignment rate
172453 reads; of these:
172453 (100.00%) were unpaired; of these:
166169 (96.36%) aligned 0 times
2126 (1.23%) aligned exactly 1 time
4158 (2.41%) aligned >1 times
3.64% overall alignment rate
169326 reads; of these:
169326 (100.00%) were unpaired; of these:
162555 (96.00%) aligned 0 times
2517 (1.49%) aligned exactly 1 time
4254 (2.51%) aligned >1 times
4.00% overall alignment rate
172453 reads; of these:
172453 (100.00%) were unpaired; of these:
166146 (96.34%) aligned 0 times
2266 (1.31%) aligned exactly 1 time
4041 (2.34%) aligned >1 times
3.66% overall alignment rate
156014 reads; of these:
156014 (100.00%) were unpaired; of these:
149987 (96.14%) aligned 0 times
620 (0.40%) aligned exactly 1 time
5407 (3.47%) aligned >1 times
3.86% overall alignment rate
160131 reads; of these:
160131 (100.00%) were unpaired; of these:
152865 (95.46%) aligned 0 times
742 (0.46%) aligned exactly 1 time
6524 (4.07%) aligned >1 times
4.54% overall alignment rate
156014 reads; of these:
156014 (100.00%) were unpaired; of these:
2764 (1.77%) aligned 0 times
16057 (10.29%) aligned exactly 1 time
137193 (87.94%) aligned >1 times
98.23% overall alignment rate
160131 reads; of these:
160131 (100.00%) were unpaired; of these:
3317 (2.07%) aligned 0 times
16148 (10.08%) aligned exactly 1 time
140666 (87.84%) aligned >1 times
97.93% overall alignment rate
42871 reads; of these:
42871 (100.00%) were unpaired; of these:
41384 (96.53%) aligned 0 times
205 (0.48%) aligned exactly 1 time
1282 (2.99%) aligned >1 times
3.47% overall alignment rate
41708 reads; of these:
41708 (100.00%) were unpaired; of these:
39940 (95.76%) aligned 0 times
238 (0.57%) aligned exactly 1 time
1530 (3.67%) aligned >1 times
4.24% overall alignment rate
169326 reads; of these:
169326 (100.00%) were unpaired; of these:
8638 (5.10%) aligned 0 times
15012 (8.87%) aligned exactly 1 time
145676 (86.03%) aligned >1 times
94.90% overall alignment rate
172453 reads; of these:
172453 (100.00%) were unpaired; of these:
8675 (5.03%) aligned 0 times
15082 (8.75%) aligned exactly 1 time
148696 (86.22%) aligned >1 times
94.97% overall alignment rate
Ì also checked the final result SRR794547/result/SRR794547_final.result.txt and as below:
A HLA-A*31:01:02 HLA-A*02:01:01
B HLA-B*07:02:01 HLA-B*15:01:01
C HLA-C*03:03:01 HLA-C*07:02:01
DRB1 HLA-DRB1*15:01:01 HLA-DRB1*04:04:01
DQA1 HLA-DQA1*01:02:01 HLA-DQA1*03:01:01
DQB1 HLA-DQB1*06:02:01 HLA-DQB1*03:02:01
DPA1 HLA-DPA1*01:03:01 -
DPB1 HLA-DPB1*02:01:02 HLA-DPB1*06:01
DMA HLA-DMA*01:01:01 -
DMB HLA-DMB*01:01:01 -
DOA HLA-DOA*01:01:02 HLA-DOA*01:01:01
DOB HLA-DOB*01:01:01 HLA-DOB*01:04:01
DRA HLA-DRA*01:02:03 HLA-DRA*01:01:01
DRB2 Not typed Not typed
DRB3 HLA-DRB3*02:02:01 -
DRB4 HLA-DRB4*01:03:01 HLA-DRB4*01:03:01
DRB5 HLA-DRB5*01:01:01 -
DRB6 HLA-DRB6*02:01 -
DRB7 HLA-DRB7*01:01:01 -
DRB8 HLA-DRB8*01:01 -
DRB9 HLA-DRB9*01:01 -
E HLA-E*01:01:01 HLA-E*01:03:02
F HLA-F*01:01:01 HLA-F*01:01:03
G HLA-G*01:03:01 HLA-G*01:01:01
H HLA-H*01:01:01 HLA-H*02:04
J HLA-J*01:01:01 -
K HLA-K*01:01:01 HLA-K*01:02
L HLA-L*01:01:01 HLA-L*01:02
V HLA-V*01:01:01 -
So hlahd is running fine manually outside the nextNEOpi. Thanks
Can you post the .command.run
from /mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/dd/b93680559f929b4300e69e79c18561/
command.run.tar.gz thanks
Do you run nextNEOpi
on a cluster? If yes, then please make sure that also your compute nodes have bowtie2
installed and located somewhere in your PATH
Hi, I am not running on a cluster. I am running nextNEOpi on a Windows 11 machine using WSL2 ubuntu installed. This workstation has: 48GB RAM 16 CPUs I did set the PATH variable for bowtie2 in the Ubuntu. Any comment? Thanks for your time
Hi, I know my troubleshooting has been painful. Do you know any groups where I can interact with people who is running nextNEOpi successfully? Also, I may find someone running this tool on WSL!
Thank you for your time
Hi, we run it regularly at our institute and it works fine on native Linux. Unfortunately we do not use WSL and we have no information about other groups using WSL, so it is difficult for us to help you with WSL.
It seems, however, that somehow the path to bowtie2
is not correctly set when launching the pipeline.
Can you run the following nextflow
code, and post the output:
nextflow.enable.dsl = 1
process TEST_PATH {
debug true
input:
output:
script:
"""
echo "PATH:"
echo \$PATH
echo ""
"""
}
process TEST_BWT2 {
debug true
input:
output:
script:
"""
echo "BOWTIE2"
bowtie2 --version
echo ""
"""
}
save the code as test.nf
and run it as follows:
NXF_VER=22.10.8 nextflow run test.nf
below is the command:
Analysis/nextNEOpi/demo$ NXF_VER=22.10.8 nextflow run test.nf
below is output:
N E X T F L O W ~ version 22.10.8
Launching `test.nf` [friendly_caravaggio] DSL1 - revision: 0fa69dde75
executor > local (2)
[9f/3abc96] process > TEST_PATH [100%] 1 of 1 ✔
[64/74d8f7] process > TEST_BWT2 [ 0%] 0 of 1
PATH:
/usr/local/go/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.2/bin:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.2/libnvvp:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/bin:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/libnvvp:/mnt/c/Program Files/Eclipse Adoptium/jdk-11.0.13.8-hotspot/bin:/mnt/c/Program Files/Eclipse Adoptium/jdk-17.0.1.12-hotspot/bin:/mnt/c/windows/system32:/mnt/c/windows:/mnt/c/windows/System32/Wbem:/mnt/c/windows/System32/WindowsPowerShell/v1.0/:/mnt/c/windows/System32/OpenSSH/:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/MATLAB/R2020b/bin:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.0/extras/CUPTI/lib64:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.0/include:/mnt/c/tools/cuda/bin:/mnt/c/WINDOWS/system32:/mnt/c/WINDOWS:/mnt/c/WINDOWS/System32/Wbem:/mnt/c/WINDOWS/System32/WindowsPowerShell/v1.0/:/mnt/c/WINDOWS/System32/OpenSSH/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/NVIDIA Corporation/Nsight Compute 2020.3.0/:/mnt/c/Program Files (xexecutor > local (2)
[9f/3abc96] process > TEST_PATH [100%] 1 of 1 ✔
[64/74d8f7] process > TEST_BWT2 [100%] 1 of 1 ✔
PATH:
/usr/local/go/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.2/bin:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.2/libnvvp:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/bin:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/libnvvp:/mnt/c/Program Files/Eclipse Adoptium/jdk-11.0.13.8-hotspot/bin:/mnt/c/Program Files/Eclipse Adoptium/jdk-17.0.1.12-hotspot/bin:/mnt/c/windows/system32:/mnt/c/windows:/mnt/c/windows/System32/Wbem:/mnt/c/windows/System32/WindowsPowerShell/v1.0/:/mnt/c/windows/System32/OpenSSH/:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/MATLAB/R2020b/bin:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.0/extras/CUPTI/lib64:/mnt/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.0/include:/mnt/c/tools/cuda/bin:/mnt/c/WINDOWS/system32:/mnt/c/WINDOWS:/mnt/c/WINDOWS/System32/Wbem:/mnt/c/WINDOWS/System32/WindowsPowerShell/v1.0/:/mnt/c/WINDOWS/System32/OpenSSH/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/NVIDIA Corporation/Nsight Compute 2020.3.0/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/WINDOWS/system32:/mnt/c/WINDOWS:/mnt/c/WINDOWS/System32/Wbem:/mnt/c/WINDOWS/System32/WindowsPowerShell/v1.0/:/mnt/c/WINDOWS/System32/OpenSSH/:/mnt/c/Users/dralmazroa/AppData/Local/Programs/Julia-1.10.0/bin:/mnt/c/Users/dralmazroa/.julia/packages/VariantVisualization/1yoNl:/snap/bin:/home/mamoon/.dotnet/tools:/mnt/e/Mamoon/software:/home/mamoon/go/bin:/mnt/e/Mamoon/software/bowtie2-2.5.2-linux-x86_64:/mnt/e/Mamoon/software/HLAHD/hlahd.1.5.0/bin
BOWTIE2
/mnt/e/Mamoon/software/bowtie2-2.5.2-linux-x86_64/bowtie2-align-s version 2.5.2
64-bit
Built on 3da406ad167d
Fri Oct 13 23:41:48 UTC 2023
Compiler: gcc version 9.3.1 20200408 (Red Hat 9.3.1-2) (GCC)
Options: -O3 -msse2 -funroll-loops -g3 -g -O2 -fvisibility=hidden -I/hbb_exe_gc_hardened/include -ffunction-sections -fdata-sections -fstack-protector -D_FORTIFY_SOURCE=2 -fPIE -std=c++11 -DPOPCNT_CAPABILITY -DNO_SPINLOCK -DWITH_QUEUELOCK=1 -DWITH_ZSTD
Sizeof {int, long, long long, void*, size_t, off_t}: {4, 8, 8, 8, 8, 8}
Strange that "PATH:" is printed out twice and that the line below it is different. In the first case the bowtie2
path is missing. However, there should be only one output for "PATH:" if you are running exactly the code from above.
Can you edit the nextNEOpi.nf
and add echo \$PATH
before and after line https://github.com/icbi-lab/nextNEOpi/blob/fe7b21cdc0b97aae38195e5f5ac1b9851674f6b1/nextNEOpi.nf#L5024
Then run the example again and post the .command.log .command.err .command.out
from the hlahd
process work dir.
I added "echo \$PATH" as you mentioned. Surprisingly, "get_vhla (test)" is successful this time. I am really surprised. Do you have an explanation for this behavior? The pipeline is still running. I will update you on the status. Many thanks
Now it broke on "cnvkit (test)" stage. Please see the error below:
Error executing process > 'CNVkit (test)'
Caused by:
Process `CNVkit (test)` terminated with an error exit status (1)
Command executed:
# set Agg as backend for matplotlib
export MATPLOTLIBRC="./matplotlibrc"
echo "backend : Agg" > $MATPLOTLIBRC
cnvkit.py \
batch \
test_tumor_DNA_aligned_sort_mkdp.bam \
--normal test_normal_DNA_aligned_sort_mkdp.bam \
--method hybrid \
--targets S07604514_Covered.bed \
--fasta GRCh38.d1.vd1.fa \
--annotate gencode.v33.primary_assembly.annotation.gtf \
--access access-5kb.GRCh38.bed \
\
-p 16 \
--output-reference output_reference.cnn \
--output-dir ./
cnvkit.py segmetrics \
-s test_tumor_DNA_aligned_sort_mkdp.cn{s,r} \
--ci \
--pi
cnvkit.py call \
test_tumor_DNA_aligned_sort_mkdp.cns \
--filter ci \
-m clonal \
--purity 0.55 \
--sample-sex female \
\
-o test_tumor_DNA_aligned_sort_mkdp.call.cns
cnvkit.py \
scatter \
test_tumor_DNA_aligned_sort_mkdp.cnr \
-s test_tumor_DNA_aligned_sort_mkdp.cns \
-o test_tumor_DNA_aligned_sort_mkdp_scatter.png
cnvkit.py \
diagram \
test_tumor_DNA_aligned_sort_mkdp.cnr \
-s test_tumor_DNA_aligned_sort_mkdp.cns \
--sample-sex female \
\
-o test_tumor_DNA_aligned_sort_mkdp_diagram.pdf
cnvkit.py \
breaks \
test_tumor_DNA_aligned_sort_mkdp.cnr test_tumor_DNA_aligned_sort_mkdp.cns \
-o test_tumor_DNA_aligned_sort_mkdp_breaks.tsv
cnvkit.py \
genemetrics \
test_tumor_DNA_aligned_sort_mkdp.cnr \
-s test_tumor_DNA_aligned_sort_mkdp.cns \
--sample-sex female \
\
-t 0.2 -m 5 \
-o test_tumor_DNA_aligned_sort_mkdp_gainloss.tsv
# run PDF to PNG conversion if mogrify and gs is installed
mogrify -version > /dev/null 2>&1 && \
gs -v > /dev/null 2>&1 && \
mogrify -density 600 -resize 2000 -format png *.pdf
# clean up
rm -f $MATPLOTLIBRC
Command exit status:
1
Command output:
(empty)
Command error:
Summary: #bins=280504, #reads=1184722, mean=4.2236, min=0.0, max=2244.1825396825398
Percent reads in regions: 51.077 (of 2319488 mapped)
Wrote ./test_tumor_DNA_aligned_sort_mkdp.targetcoverage.cnn with 280504 regions
Processing reads in test_tumor_DNA_aligned_sort_mkdp.bam
Time: 2.705 seconds (178542 reads/sec, 15074 bins/sec)
Summary: #bins=40780, #reads=483025, mean=11.8447, min=0.0, max=10338.79365079365
Percent reads in regions: 20.825 (of 2319488 mapped)
Wrote ./test_tumor_DNA_aligned_sort_mkdp.antitargetcoverage.cnn with 40780 regions
Processing target: test_tumor_DNA_aligned_sort_mkdp
Keeping 277 of 280504 bins
WARNING: most bins have no or very low coverage; check that the right BED file was used
Processing antitarget: test_tumor_DNA_aligned_sort_mkdp
Keeping 4 of 40780 bins
Correcting for GC bias...
Correcting for RepeatMasker bias...
WARNING: Most antitarget bins (100.00%, 4/4) have low or no coverage; is this amplicon/WGS?
/usr/local/lib/python3.10/site-packages/cnvlib/cnary.py:492: FutureWarning: The default dtype for empty Series will be 'object' instead of 'float64' in a future version. Specify a dtype explicitly to silence this warning.
return pd.concat(resids) if resids else pd.Series([])
Antitargets are nan x more variable than targets
Wrote ./test_tumor_DNA_aligned_sort_mkdp.cnr with 281 regions
Segmenting ./test_tumor_DNA_aligned_sort_mkdp.cnr ...
Segmenting with method 'cbs', significance threshold 0.0001, in 16 processes
Traceback (most recent call last):
File "/usr/local/bin/cnvkit.py", line 10, in <module>
sys.exit(main())
File "/usr/local/lib/python3.10/site-packages/cnvlib/cnvkit.py", line 10, in main
args.func(args)
File "/usr/local/lib/python3.10/site-packages/cnvlib/commands.py", line 2075, in _cmd_segmetrics
segarr = read_cna(args.segments)
File "/usr/local/lib/python3.10/site-packages/cnvlib/cmdutil.py", line 12, in read_cna
return tabio.read(infile, into=CNA, sample_id=sample_id, meta=meta)
File "/usr/local/lib/python3.10/site-packages/skgenome/tabio/__init__.py", line 75, in read
dframe = reader(infile, **kwargs)
File "/usr/local/lib/python3.10/site-packages/skgenome/tabio/tab.py", line 17, in read_tab
dframe = pd.read_csv(infile, sep="\t", dtype={"chromosome": "str"})
File "/usr/local/lib/python3.10/site-packages/pandas/util/_decorators.py", line 211, in wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/pandas/util/_decorators.py", line 331, in wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/pandas/io/parsers/readers.py", line 950, in read_csv
return _read(filepath_or_buffer, kwds)
File "/usr/local/lib/python3.10/site-packages/pandas/io/parsers/readers.py", line 605, in _read
parser = TextFileReader(filepath_or_buffer, **kwds)
File "/usr/local/lib/python3.10/site-packages/pandas/io/parsers/readers.py", line 1442, in __init__
self._engine = self._make_engine(f, self.engine)
File "/usr/local/lib/python3.10/site-packages/pandas/io/parsers/readers.py", line 1735, in _make_engine
self.handles = get_handle(
File "/usr/local/lib/python3.10/site-packages/pandas/io/common.py", line 856, in get_handle
handle = open(
FileNotFoundError: [Errno 2] No such file or directory: 'test_tumor_DNA_aligned_sort_mkdp.cns'
Work dir:
/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/cd/6aa76a9d0aa4297882852c614adf0a
Tip: view the complete command output by changing to the process work dir and entering the command `cat .command.out`
I attached below the files from work dir: "cd/6aa76a9d0aa4297882852c614adf0a"
command.out.tar.gz command.log.tar.gz command.err.tar.gz command.run.tar.gz
Thanks
can you post the output of ls -la /mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/cd/6aa76a9d0aa4297882852c614adf0a
What kind of data is it? WES, WGS or a targeted panel? TO bi sure, it is human, right?
mamoon@DESKTOP-CFF6CD1:/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/cd/6aa76a9d0aa4297882852c614adf0a$ ls -la
total 69000
drwxr-xr-x 1 mamoon mamoon 4096 Jan 10 16:16 .
drwxr-xr-x 1 mamoon mamoon 4096 Jan 10 16:01 ..
-rw-r--r-- 1 mamoon mamoon 0 Jan 10 16:11 .command.begin
-rw-r--r-- 1 mamoon mamoon 7531 Jan 10 16:16 .command.err
-rw-r--r-- 1 mamoon mamoon 7531 Jan 10 16:16 .command.log
-rw-r--r-- 1 mamoon mamoon 0 Jan 10 16:11 .command.out
-rw-r--r-- 1 mamoon mamoon 11950 Jan 10 16:11 .command.run
-rw-r--r-- 1 mamoon mamoon 1777 Jan 10 16:11 .command.sh
-rw-r--r-- 1 mamoon mamoon 0 Jan 10 16:11 .command.trace
-rw-r--r-- 1 mamoon mamoon 1 Jan 10 16:16 .exitcode
lrwxrwxrwx 1 mamoon mamoon 82 Jan 10 16:11 GRCh38.d1.vd1.fa -> /mnt/e/Mamoon/db_nextflow/references/hg38/gdc/GRCh38.d1.vd1/fasta/GRCh38.d1.vd1.fa
lrwxrwxrwx 1 mamoon mamoon 86 Jan 10 16:11 GRCh38.d1.vd1.fa.fai -> /mnt/e/Mamoon/db_nextflow/references/hg38/gdc/GRCh38.d1.vd1/fasta/GRCh38.d1.vd1.fa.fai
-rw-r--r-- 1 mamoon mamoon 1422362 Jan 10 16:14 S07604514_Covered.antitarget.bed
lrwxrwxrwx 1 mamoon mamoon 81 Jan 10 16:11 S07604514_Covered.bed -> /mnt/e/Mamoon/db_nextflow/ExomeCaptureKits/Agilent/hg38/S07604514_Covered_ann.bed
-rw-r--r-- 1 mamoon mamoon 13019076 Jan 10 16:14 S07604514_Covered.target.bed
lrwxrwxrwx 1 mamoon mamoon 112 Jan 10 16:11 access-5kb.GRCh38.bed -> /mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/2d/9c7dacbfc621aa5b13944e0417b527/access-5kb.GRCh38.bed
lrwxrwxrwx 1 mamoon mamoon 96 Jan 10 16:11 gencode.v33.primary_assembly.annotation.gtf -> /mnt/e/Mamoon/db_nextflow/references/hg38/annotation/gencode.v33.primary_assembly.annotation.gtf
-rw-r--r-- 1 mamoon mamoon 14 Jan 10 16:11 matplotlibrc
-rw-r--r-- 1 mamoon mamoon 23229970 Jan 10 16:15 output_reference.cnn
-rw-r--r-- 1 mamoon mamoon 1693156 Jan 10 16:14 test_normal_DNA_aligned_sort_mkdp.antitargetcoverage.cnn
lrwxrwxrwx 1 mamoon mamoon 128 Jan 10 16:11 test_normal_DNA_aligned_sort_mkdp.bai -> /mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/c3/ad562c927de630a227e9028575c363/test_normal_DNA_aligned_sort_mkdp.bai
lrwxrwxrwx 1 mamoon mamoon 128 Jan 10 16:11 test_normal_DNA_aligned_sort_mkdp.bam -> /mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/c3/ad562c927de630a227e9028575c363/test_normal_DNA_aligned_sort_mkdp.bam
-rw-r--r-- 1 mamoon mamoon 14759907 Jan 10 16:14 test_normal_DNA_aligned_sort_mkdp.targetcoverage.cnn
-rw-r--r-- 1 mamoon mamoon 1699632 Jan 10 16:16 test_tumor_DNA_aligned_sort_mkdp.antitargetcoverage.cnn
lrwxrwxrwx 1 mamoon mamoon 127 Jan 10 16:11 test_tumor_DNA_aligned_sort_mkdp.bai -> /mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/6d/1d954d7c42f1d93703c4538e708e6c/test_tumor_DNA_aligned_sort_mkdp.bai
lrwxrwxrwx 1 mamoon mamoon 127 Jan 10 16:11 test_tumor_DNA_aligned_sort_mkdp.bam -> /mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo/work/6d/1d954d7c42f1d93703c4538e708e6c/test_tumor_DNA_aligned_sort_mkdp.bam
-rw-r--r-- 1 mamoon mamoon 18729 Jan 10 16:16 test_tumor_DNA_aligned_sort_mkdp.cnr
-rw-r--r-- 1 mamoon mamoon 14762084 Jan 10 16:16 test_tumor_DNA_aligned_sort_mkdp.targetcoverage.cnn
The test data is as below: Test data is taken from: https://apps-01.i-med.ac.at/resources/nextneopi/nextNEOpi_testdata.tar.gz
mamoon@DESKTOP-CFF6CD1:/mnt/e/Mamoon/NRC21R-082-03/Analysis/nextNEOpi/demo$ ls -la testdata/
total 316376
drwxrwxrwx 1 root root 4096 Sep 21 18:37 .
drwxrwxrwx 1 root root 4096 Jan 10 15:35 ..
-rwxrwxrwx 1 root root 52692230 Sep 21 18:37 normal_DNA_R1.fastq.gz
-rwxrwxrwx 1 root root 52770181 Sep 21 18:36 normal_DNA_R2.fastq.gz
-rwxrwxrwx 1 root root 353 Sep 21 18:36 regions_4_test_dataset.txt
-rwxrwxrwx 1 root root 73599182 Sep 21 18:36 tumor_DNA_R1.fastq.gz
-rwxrwxrwx 1 root root 74170100 Sep 21 18:36 tumor_DNA_R2.fastq.gz
-rwxrwxrwx 1 root root 35029431 Sep 21 18:37 tumor_RNA_R1.fastq.gz
-rwxrwxrwx 1 root root 35695498 Sep 21 18:36 tumor_RNA_R2.fastq.gz
I realized the test data will not run successfully at "cnvit" due to limited read coverage. So, I am running the pipeline without "cnvkit". Let us see what happens. Thanks
No more feedback, assuming the issue as solved
After I fixed several issues with your help, the new issue popped up. Below is the error:
Please help to troubleshoot. It looks like "HLA_parser.py" is having some bug in indexing!
Thanks a lot