mcmero / SVclone

A computational method for inferring the cancer cell fraction of tumour structural variation from whole-genome sequencing data.
BSD 3-Clause "New" or "Revised" License
40 stars 10 forks source link

Error while running SV clone using the 'consensus' mode for SNVs #35

Closed poojachandra closed 4 months ago

poojachandra commented 4 months ago

I want to input SNVs from multiple tools and would like to use SV Clone (with SNV) using the 'consensus' mode. The tool does not output the required output files. Is there an example of the file that I could use with the 'consensus' mode ? Thanks

mcmero commented 4 months ago

The consensus SNV input format was intended for SNVs from the PCAWG project, it does not perform any kind of consensus function. I would recommend creating your own consensus SNV file based on overlapping calls or use a tool/method intended for this function.

poojachandra commented 4 months ago

Thanks @mcmero . What should be the format for consensus file ?

mcmero commented 4 months ago

Please see the documentation for the supported SNV VCF formats.

poojachandra commented 4 months ago

Hi, Based on the documentation and using this file https://dcc.icgc.org/api/v1/download?fn=/release_28/Projects/BRCA-EU/simple_somatic_mutation.open.BRCA-EU.tsv.gz as an example, I created a file with the following columns to the consensus file:

But, the SV Clone is not taking the SNV input. Is there a column that might be missing ? Or perhaps if you could point me to an example file (with open access) for the consensus approach that would be helpful.

Thank You

mcmero commented 4 months ago

Can you please post the command and error message from SVclone, as well as a sample of your VCF input? The consensus format has t_ref_count and t_alt_count fields in the INFO field. I don't have a sample on hand as these data are behind ICGC access control.

There is nothing 'special' about this format specifically for consensus calls, it was just what was used in the PCAWG project by the variant calling working group. Any of the supported SNV formats will work when you create your SNV file, it will just depend on the VCF output you have available from your variant callers.

poojachandra commented 4 months ago

Here are the commands:

svclone annotate -i SV_input -b Bam -s sampleID --sv_format simple -cfg svclone_config.ini svclone count -i sampleID/sampleID_svin.txt -b Bam -s sampleID svclone filter -s sampleID -i sampleID/sampleID_svinfo.txt -p Purity_Ploidy_input.txt --cnvs CNV_input.csv --snvs test_snv.txt --snv_format consensus

Attaching the input snv file as well. test_snv.txt

mcmero commented 4 months ago

Your SNV input is not in VCF format. Please see here for the VCF file format specification. An example vcf file is provided in the SVclone repository (this uses the mutect format).