Open anashank opened 1 year ago
Hi,
I think the difference in the vcf input format causes this error. RNAseqCNV expects the output format of GATK. If you compare the accepted format in our README(https://github.com/honzee/RNAseqCNV#2141-vcf-) and your file, there are differences.
If possible, I would generate the vcf files with GATK, otherwise, you could reformat the files you already have.
Also, you could first try reformating the CHROM column as suggested here: https://github.com/honzee/RNAseqCNV/issues/22
Best, Jan
I've encountered the same issue when using "custom" SNV table. In my case the problem was due to SNV filtering on database produced empty table. Passing SNP_to_keep = FALSE
argument to the wrapper bypasses the filtering step.
Good point @DzmitryGB , thank you for the comment.
Hi, when I run using a VCF generated from another pipeline, I get this long error message (See below). The format of the VCF I have is also shown below. It has some differences in the INFO, FORMAT fields compared to the GATK VCF. Can this VCF be used as input? What changes need to be made for the tool to work? Any suggestions would be helpful!
CHROM POS ID REF ALT QUAL FILTER INFO FORMAT Sample
chr1 1168629 . C T . weak_evidence DP=4;MQ=216.96;FractionInformativeReads=1.000;RatioSoftClips=0.00 GT:SQ:AD:AF:F1R2:F2R1:DP:SB:MB 0/1:1.03:3,1:0.250:2,0:1,1:4:3,0,0,1:2,1,1,0
Reading in vcf file..
Extracting depth..
Extracting reference allele and alternative allele depths..
Needed information from vcf extracted
Finished reading vcf
Error in
mutate()
: ℹ In argument:snvOrd = 1:n()
. Caused by error: !snvOrd
must be size 0 or 1, not 2. Traceback:<named list>
), abort(class = c(class, "dplyr:::internal_error"), . dplyr_error_data = data)), parent = c(0L, 1L, 2L, 3L, . 4L, 5L, 6L, 7L, 6L, 9L, 10L, 4L, 12L, 13L, 13L, 15L, 16L, 17L, . 18L, 19L, 13L, 13L, 13L, 23L, 24L, 0L, 26L, 27L, 0L, 0L, 0L, . 0L, 0L, 0L, 34L, 35L, 35L, 37L, 38L, 0L, 40L), visible = c(TRUE, . TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, . TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, . TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, . TRUE, TRUE, TRUE, TRUE, TRUE, TRUE, FALSE), namespace = c("IRkernel", . NA, "IRkernel", NA, "base", "base", "base", "base", "base", "base", . "base", "evaluate", "evaluate", "evaluate", "evaluate", "base", . "base", "base", "base", "base", "base", "base", "evaluate", "base", . "base", "RNAseqCNV", "RNAseqCNV", NA, "dplyr", "dplyr", "dplyr", . "dplyr", "dplyr", "dplyr", "dplyr", "base", "dplyr", NA, "dplyr", . "dplyr", "rlang"), scope = c("::", NA, "local", NA, "::", "local", . "local", "local", "local", "local", "local", "::", ":::", "local", . "local", "::", "::", "local", "local", "local", "::", "::", ":::", . "::", "::", "::", ":::", NA, "::", "::", "::", "::", "::", ":::", . ":::", "::", ":::", NA, "local", ":::", "::"), error_frame = c(FALSE, . FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, . FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, . FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, . FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, . FALSE, FALSE, TRUE, FALSE)), row.names = c(NA, -41L), version = 2L, class = c("rlang_trace", . "rlib_trace", "tbl", "data.frame")), parent = NULL, dplyr_error_data = list( . result_size = 2L, expected_size = 0L), call = dplyr_internal_error("dplyr:::mutate_incompatible_size", . list(result_size = 2L, expected_size = 0L)), use_cli_format = TRUE), class = c("dplyr:::mutate_incompatible_size", . "dplyr:::internal_error", "rlang_error", "error", "condition" . )))