if the sites did not include reference C's, then this could be used for BS-Seq data.
currently, the sites file includes 38399 sites and 27978 of them are not C's and 30415 of them are not C to T.
So we could just distribute a sites file with no C/T variants.
if the sites did not include reference C's, then this could be used for BS-Seq data.
currently, the sites file includes 38399 sites and 27978 of them are not C's and 30415 of them are not C to T. So we could just distribute a sites file with no C/T variants.