SURVIVOR filter based on AF has no effect #96

Open oleraj opened 5 years ago

oleraj commented 5 years ago


I have Manta VCFs >100 individuals that I've merged using SURVIVOR merge with this command:

SURVIVOR merge sample_files.txt 1000 1 1 1 0 0 Manta_merged.vcf 

Then I tried to filter based on AF using this command:

SURVIVOR filter Manta_merged.vcf NA -1 -1 0.10 10 Manta_merged.filt.AF10.vcf


SURVIVOR filter Manta_merged.vcf NA -1 -1 1.00 10 Manta_merged.filt.AF100.vcf

However, it seems there is no difference in the output. I think the filter is not working. Both have the same number of variants:

wc -l Manta_merged.filt.AF10.vcf Manta_merged.filt.AF100.vcf 
     64111 Manta_merged.filt.AF10.vcf
     64111 Manta_merged.filt.AF100.vcf

I also noticed that SURVIVOR merge doesn't add an AF field to the VCF. Is it supposed to add this?


Any other suggestions for filtering by AF?



fritzsedlazeck commented 5 years ago

Hi Andrew, the easier first: SURVIVOR filter looks for the AF tag. Thus, it wont work if that is not there.

SURVIVOR merge currently does not extend the VCF by an AF tag. Its a nice idea to include it and code is there. I just dont take the genotype into account so it will be the frequency of samples. Would that be ok?

Thanks Fritz

oleraj commented 5 years ago

Hi Fritz,

I'm not sure why you're not able to take genotype into account -- is it because the genotype calls from different SV callers are not consistent or trustworthy?

An alternative I'm thinking I could use the SUPP tag for filtering using bcftools. However, the data type is not specified correctly in the header -- it should be Integer but says String so bcftools can't use this for filtering:

##INFO=<ID=SUPP,Number=1,Type=String,Description="Number of samples supporting the variant">

Could you update the data type for SUPP and other tags in the header (e.g., CIEND, CIPOS) to Integer?



fritzsedlazeck commented 5 years ago

I could, but some are not reporting it and some are not very robust. Maybe I just should .. sorry for loud thinking..

oleraj commented 5 years ago

No problem, that makes sense. For now I think the highest priority would be to update the type for these tags to Integer as I mentioned.