This script takes 2 files containing phased read names from Pepper-Marginphase-DeepVariant and compares them to the phasing described in Shasta's Phasing.csv. Some summary stats are generated, and each read which is in disagreement between the 2 phase datasets is written to a CSV.
This script takes 2 files containing phased read names from Pepper-Marginphase-DeepVariant and compares them to the phasing described in Shasta's Phasing.csv. Some summary stats are generated, and each read which is in disagreement between the 2 phase datasets is written to a CSV.
Example stdout:
(where ARI = adjusted rand index)
Example CSV log: