This script accepts an arg for the number of non-ref samples per site and filter a VDS to only those sites. It then calculates the number of those sites that are shared between pairs. For example, if the passed arg non_ref_samples is equal to 3, the script will filter the VDS to sites where hail's n_non_ref calculates to 3. It then grabs the samples found at that site, creates a list of sample pairs, and tallies the number of sites where n_non_ref=3 per pair. The script has an additional het_only filter if you just want to consider sites where no hom_var exists.
This script accepts an arg for the number of non-ref samples per site and filter a VDS to only those sites. It then calculates the number of those sites that are shared between pairs. For example, if the passed arg
non_ref_samples
is equal to 3, the script will filter the VDS to sites where hail's n_non_ref calculates to 3. It then grabs the samples found at that site, creates a list of sample pairs, and tallies the number of sites where n_non_ref=3 per pair. The script has an additionalhet_only
filter if you just want to consider sites where no hom_var exists.