CenMAP
A centromere mapping and annotation pipeline for T2T human genome assemblies implemented in Snakemake
.
|
|
Verkko
or hifiasm
human genome assemblies
- PacBio HiFi reads used in the assemblies
CHM13
reference genome assembly
- (Optional) Unaligned BAM files with 5mC modifications at CpG sites.
- Complete and correctly assembled centromere sequences and their regions.
- Centromere alpha-satellite higher order repeat (HOR) array lengths.
RepeatMasker
and HumAS-HMMER
alpha-satellite HOR monomer annotations and plots.
ModDotPlot
sequence identity plots.
- Combined sequence identity and HOR array structure plots.
- (Optional) Centromere dip region (CDRs) with
CDR-Finder
Read the docs on the CenMAP
wiki.
To run tests, refer to the wiki page.