Open yuanzhao0502 opened 6 years ago
Hi @yuanzhao0502
The coverage
field in the parameter file is hard-coded to report what you specify as coverage to the script. It is a legacy feature and not used in the snakemake pipeline.
To determine the coverage, you need to compute this yourself - as you have done.
You should not use any bin size smaller than 500kb if you do not have a matched normal sample. If you use smaller bins, then germline CNVs can confound the tumor fraction estimation. It is usually better to simply use 1Mb bins because it offers the cleanest normalized read coverage signals for estimating the tumor fraction. I am not sure why you are seeing the discrepancy between 1Mb and 500kb. Without having seen the results/plots, I would probably tend to trust the 1Mb results.
Hope this helps. Gavin
I am sorry to ask some basic questions: