Closed biomobot closed 11 months ago
We recommend using:
ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/405/GCA_000001405.15_GRCh38/seqs_for_alignment_pipelines.ucsc_ids/GCA_000001405.15_GRCh38_no_alt_analysis_set.fna.gz
Thanks @cjw85 Unfortunately, still getting the same error using the reference you recommended.
/home/mbio/epi2melabs/workflows/epi2me-labs/wf-human-variation/main.nf
[affectionatekepler] DSL2 - revision: 8546042309
|||||||||| ____ _ __ _
|||||||||| | _| _ | | \/ | __| | | | |
||||| | | | |) | | ) | |\/| | _| ___| |/ ` | ' \/ |
||||| | |_| /| | / /| | | | |_|| | (| | |) _ \
|||||||||| |____|_| |_|___|| ||| ||\,|._/|/
|||||||||| wf-human-variation v1.4.0If you use epi2me-labs/wf-human-variation for your analysis please cite:
[e8/3a5f63] Submitted process > cram_cache (1)
[d6/f6bd0a] Submitted process > getParams
[62/e5a427] Submitted process > snp:getVersions
[49/5c54a5] Submitted process > snp:getParams
[4a/844db9] Submitted process > str:getParams
[74/1a3314] Submitted process > getVersions
[fd/88e0a3] Submitted process > sv:runReport:getParams
[0b/a8f335] Submitted process > basecalling:wf_dorado:make_mmi
[85/ff4333] Submitted process > lookup_clair3_model (1)
[26/33146b] Submitted process > index_ref_fai (1)
[0a/cacb93] Submitted process > sv:runReport:getVersions
[40/ab7970] Submitted process > str:getVersions
[a9/2816a5] Submitted process > basecalling:wf_dorado:dorado (1)
[d9/46877a] Submitted process > basecalling:wf_dorado:dorado (2)
[73/7e22ad] Submitted process > getAllChromosomesBed (1)
[74/962c13] Submitted process > publish_artifact (3)
[21/cbec8d] Submitted process > publish_artifact (1)
[4a/551295] Submitted process > publish_artifact (2)
[5d/19cae9] Submitted process > sv:runBenchmark:intersectBedWithTruthset (1)
Error executing process > 'sv:runBenchmark:intersectBedWithTruthset (1)'
Caused by:
Process sv:runBenchmark:intersectBedWithTruthset (1)
terminated with an error exit status (1)
Command executed:
bedtools intersect -a ${WFSV_EVAL_DATA_PATH}/benchmark.bed -b allChromosomes.bed > target_truthset.bed
if [ ! -s target_truthset.bed ]
then
echo "No overlaps found between truth and target"
echo "Chr names in your target or reference and truthset may differ"
exit 1
fi
Command exit status:
1
Command output:
No overlaps found between truth and target
Chr names in your target or reference and truthset may differ
Command error:
***** WARNING: File /data/wf_human_sv_benchmark/NIST_SVs_Integration_v0.6/benchmark.bed has inconsistent naming convention for record:
1 834131 843115
***** WARNING: File /data/wf_human_sv_benchmark/NIST_SVs_Integration_v0.6/benchmark.bed has inconsistent naming convention for record:
1 834131 843115
Work dir:
/home/mbio/epi2melabs/instances/wf-human-variation_5fb641df-8bd2-4f43-bf27-fd5fd8beb852/work/5d/19cae93c45f8b10d35612620664f41
Tip: when you have fixed the problem you can continue the execution adding the option -resume
to the run command line
WARN: Killing running tasks (2)
The reference that @cjw85 has pointed you to (GCA_000001405.15_GRCh38_no_alt_analysis_set) is the reference we recommend for running the human variation pipeline. However, the SV benchmark data we use from NIST specifically requires alignments to human_g1k_v37 which can be downloaded via http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/.
Please note that the SV benchmarks will only be valid if you have sequenced HG002. If you do not need the benchmark information because you are not sequencing that genome, you can remove the option --sv_benchmark
. If you do not need the benchmarking information you can use the recommended reference (GCA_000001405.15_GRCh38_no_alt_analysis_set) that was linked in the previous comment.
What happened?
Thank you for updating the workflow. I am running the workflow epi2me-labs/wf-human-variation 1.4.0 on EPI2ME Labs. I have tried to use the UCSC human reference downloaded from: rsync -a -P rsync://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz Got an error.
I also tried to use the ensembl reference: Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz
Got a different error.
Not sure what reference I should be using. Maybe you can point me to the correct one? Thank you in advance. Looking forward to hearing from you.
Operating System
ubuntu 20.04
Workflow Execution
EPI2ME Labs desktop application
Workflow Execution - EPI2ME Labs Versions
EPI2ME Labs V4.1.3
Workflow Execution - CLI Execution Profile
None
Workflow Version
1.4.0
Relevant log output