Closed Nour-EddineS closed 1 year ago
Hey! This is unfortunately a known issue, see here: https://github.com/BoevaLab/FREEC/issues/106. the current work around in the pipeline is to provide a len file using this parameter: https://nf-co.re/sarek/3.1.2/parameters#cf_chrom_len
Dear @FriederikeHanssen, How can I create a len file please? Best regards,
If you click "Help" a description opens up:
Control-FREEC requires a file containing all chromosome lenghts. By default the fasta.fai is used. If the fasta.fai file contains chromosomes not present in the intervals, it fails (see: https://github.com/BoevaLab/FREEC/issues/106).
In this case, a custom chromosome length can be specified. It must be of the same format as the fai, but only contain the relevant chromosomes.
Example shown here (it is from a different genome build, so don't use this one): http://bioinfo-out.curie.fr/projects/freec/src/hg18.len
You can check your bed file, and then only keep the chromosomes from the fai of the genome build you are using that are also in the bed file. Then name it to something my_len.len and add it with the above parameter. To download the fai, either check your local igenomes installation if you ave it, or you can get from https://ewels.github.io/AWS-iGenomes/ here
Has this worked? If yes, could you close this issue?
@FriederikeHanssen Yes it works Thanks :)
Title: FREEC_SOMATIC
Process Error: Missing Output File *_sample.cpn
in NFCORE Sarek 3.4.2
Body:
Hi NFCORE Sarek community,
I am currently running the NFCORE Sarek pipeline (version 3.4.2) for somatic variant calling on whole-exome sequencing data. During the execution of the pipeline, I encountered an issue with the FREEC_SOMATIC
process, where the expected output file(s) *_sample.cpn
are missing, causing the process to fail. Below are the details of the command I used and the error message I received:
nextflow run nf-core/sarek -r 3.4.2 -profile docker \
--input samplesheet.csv \
--outdir /home/obiorach/test-work-sarek/WES-test-result \
--genome hg19 \
--dbsnp /home/obiorach/whole-Exon-single-seq/ref-genome/known_sites.vcf/dbsnp_138.hg19.vcf.gz \
--known_indels /home/obiorach/whole-Exon-single-seq/ref-genome/known_indels.vcf/Mills_and_1000G_gold_standard.indels.hg19.sites.vcf.gz \
--max_cpus 20 \
--max_memory '30 GB' \
--igenomes_base 's3://ngi-igenomes/igenomes' \
--wes \
--intervals /home/obiorach/whole-Exon-single-seq/ref-genome/exom_targets.bed/HyperExomeV2_primary_targets.hg19.bed \
--tools mutect2,strelka,vep,manta,tiddit,cnvkit,controlfreec,msisensorpro \
--pon /home/obiorach/whole-Exon-single-seq/ref-genome/panel-of-normal/updated_Mutect2-exome-panel_vcf.vcf.gz \
--germline_resource /home/obiorach/whole-Exon-single-seq/ref-genome/germline-resource/renamed_gnomad.vcf.gz \
--vep_cache /home/obiorach/vep_cache \
--vep_species homo_sapiens \
--vep_genome GRCh37 \
--vep_cache_version 112 \
-c custom.config \
-resume
Aug-29 13:43:20.036 [TaskFinalizer-106] DEBUG nextflow.processor.TaskProcessor - Handling unexpected condition for
task: name=NFCORE_SAREK:SAREK:BAM_VARIANT_CALLING_SOMATIC_ALL:BAM_VARIANT_CALLING_SOMATIC_CONTROLFREEC:FREEC_SOMATIC (Tumour_vs_Normal); work-dir=/home/obiorach/test-work-sarek/work/c0/6ad8417512a5e92448b724cf07213a
error [nextflow.exception.MissingFileException]: Missing output file(s) `*_sample.cpn` expected by process `NFCORE_SAREK:SAREK:BAM_VARIANT_CALLING_SOMATIC_ALL:BAM_VARIANT_CALLING_SOMATIC_CONTROLFREEC:FREEC_SOMATIC (Tumour_vs_Normal)`
process {
// adding a regex to prevent the fully qualified name to take precedence
withName: ".*" {
time = 72.h
}
withName: 'NFCORE_SAREK:SAREK:BAM_VARIANT_CALLING_SOMATIC_ALL:BAM_VARIANT_CALLING_SOMATIC_MUTECT2:MUTECT2_PAIRED' {
memory = 30.GB // Allocate more memory for the Mutect2 process
cpus = 20 // Allocate more CPUs for the Mutect2 process
time = 72.h // Specific time limit for this process
}
}
.cpn
files were found.config.txt
file and command are correct..command.log
, .command.err
, and .command.out
files in the task's workDir
for additional details, but found no specific clues beyond the missing file error.-resume
flag, but the issue persists.Missing Output Files: Could anyone advise on potential causes for the missing *_sample.cpn
file and suggest further troubleshooting steps? Are there specific configuration requirements or dependencies that might lead to this issue?
Control-FREEC Visualization: Is there a special configuration required to enable visualization outputs from Control-FREEC? If so, could you provide guidance on how to set it up?
Thank you in advance for your support!
Best regards,
Obiora Ch.
Description of the bug
Dear SAREK Team, I want to use Somatic variant calling, but pipeline completed with errors. Thanks in advance for your help.
Best regards,
Command used and terminal output
Relevant files
nextflow.log samplesheet.csv tar_bla_cancer2.bed.tar.gz
System information
CPU: Intel(R) Xeon(R) CPU E5-1620 v3 @ 3.50GHz RAM: 32 GB Distribution: Ubuntu 22.04.1 LTS