morrislab / phylowgs

Application for inferring subclonal composition and evolution from whole-genome sequencing data.
GNU General Public License v3.0
108 stars 55 forks source link

AssertionError when I run create_phylowgs_inputs.py #121

Open llrhys opened 4 years ago

llrhys commented 4 years ago

I've encountered a problem while trying to generate the ssm_data.txt file using a .vcf file

/home/lvli/virtualenv/phyloWGS/lib/python2.7/site-packages/scipy/stats/stats.py:316: RuntimeWarning: divide by zero encountered in log log_a = np.log(a) Traceback (most recent call last): File "/home/lvli/virtualenv/phyloWGS/phylowgs-262325b219e6d31f672791a05c6f927a18963ded/parser/create_phylowgs_inputs.py", line 1356, in <module> main() File "/home/lvli/virtualenv/phyloWGS/phylowgs-262325b219e6d31f672791a05c6f927a18963ded/parser/create_phylowgs_inputs.py", line 1301, in main variant_ids, ref_read_counts, total_read_counts = parse_variants(samples, vcf_files, vcf_types, args.tumor_sample, args.missing_variant_confidence) File "/home/lvli/virtualenv/phyloWGS/phylowgs-262325b219e6d31f672791a05c6f927a18963ded/parser/create_phylowgs_inputs.py", line 1202, in parse_variants total_read_counts = impute_missing_total_reads(total_read_counts, missing_variant_confidence) File "/home/lvli/virtualenv/phyloWGS/phylowgs-262325b219e6d31f672791a05c6f927a18963ded/parser/create_phylowgs_inputs.py", line 1124, in impute_missing_total_reads assert np.sum(total_reads <= 0) == np.sum(np.isnan(total_reads)) == 0 AssertionError

The command is python ~/virtualenv/phyloWGS/phylowgs-262325b219e6d31f672791a05c6f927a18963ded/parser/create_phylowgs_inputs.py --cnvs S1=438CNVs.txt --cnvs S2=449CNVs.txt --cnvs S3=453CNVs.txt --cnvs S4=489CNVs.txt --cnvs S5=500CNVs.txt --cnvs S6=417CNVs.txt --vcf-type S1=vardict --vcf-type S2=vardict --vcf-type S3=vardict --vcf-type S4=vardict --vcf-type S5=vardict --vcf-type S6=vardict S1=438.hg19_multianno.vcfsubclone.vcf S2=449.hg19_multianno.vcfsubclone.vcf S3=453.hg19_multianno.vcfsubclone.vcf S4=489.hg19_multianno.vcfsubclone.vcf S5=500.hg19_multianno.vcfsubclone.vcf S6=417.hg19_multianno.vcfsubclone.vcf

I tried and find there is only one sample '417' is not useful.And I don't know why.Can anyone help me? 417CNVs.txt

head_417.vcf.txt