Closed egnst closed 1 year ago
Hi @egnst, thank you for using NanoVar.
The error you encountered results from different chromosome/contig ids between the built-in hg38 filter bed file and the reference genome you provided. The hg38 built-in filter file uses ids of 'chr1', 'chr2', etc. Can you please let me know the chromosome/contig ids of your reference file? You can easily see all of them in the '.fai' file of the reference fasta.
If that's really the problem, one easy fix is to remove the '-f hg38' the option. But do note that the output variants in the VCF will not be filtered for known false positives.
Thank you for your explanation. I tried to manually edit all my chromosomes to be "chr1" instead of "1", etc. That made my script run longer before it crashed, but I started getting hs-blastn errors instead. I assumed there was just something fundamentally off about my reference genome, so I tried to download a different reference file from UCSC that uses "chr1" notation.
However, I'm still getting these hs-blastn errors. I've tried multiple reference genomes and multiple versions of hs-blastn. I'm out of ideas, so I'm happy to try any ideas you might have. Here's the errors from the log file:
[13/12/2022 12:21:55] - INFO - Make FMD-index
[13/12/2022 12:21:55] - DEBUG - [Tue Dec 13 12:21:55 2022 corelib/line_reader.c:HbnBufferedLineReaderNew:21]
@egnst There might be a problem with the hsblastn index. Can you try rerunning with the --force option to recreate the index? Thanks
I gave it a try, and I'm still getting the same thing:
[13/12/2022 14:27:03] - DEBUG - [Tue Dec 13 14:27:03 2022 corelib/line_reader.c:HbnBufferedLineReaderNew:21]
[13/12/2022 14:33:42] - INFO - Total number of mapped reads: 1803553
[13/12/2022 14:33:42] - INFO - Clustering SV breakends
[13/12/2022 14:34:03] - INFO - Filtering INS and INV SVs
[13/12/2022 14:34:28] - DEBUG - [Tue Dec 13 14:34:28 2022 app/hbnmap/cmdline_args.cpp:ParseHbnProgramCmdLineArguments:831]
@egnst can you confirm you have write access in the folder of your reference genome?
Closed due to no response
Hi Cheng Yong,
Since the last time I used NanoVar, I've upgraded my version of Anaconda, and I've finally upgraded to using hg38, and now my old script stopped working.
I used the following command:
Then I get the following error:
I've tried this using the fastq file and an lra-aligned bam file as input, and get the same error either way. What does this error actually mean? I'm happy for any ideas on what to try troubleshooting first.