Closed guilhermesena1 closed 2 years ago
That was my mistake. We don't need the awk
step in the middle, and this kind of filtering results in only keeping significant CpGs, which merge
then just combines together into a whole chromosome (which is expected behavior)
Datasets from MethBase:
Gao-Human-2015/*Blood*.meth
Roadmap-Human-2015/*Esophagus*.meth
all from hg38design-matrix.txt
:Commands:
result: 2803905886 (which is almost the size of the human genome).
Probably not the desired behavior since many CpGs are expected to be constant and "break" the genomic regions.