ernstlab / full_stack_ChromHMM_annotations

Data of genome annotation from full-stack ChromHMM model trained with 1032 datasets from 127 reference epigenomes
32 stars 1 forks source link

full_stack_ChromHMM_annotations

Data of genome annotation from full-stack ChromHMM model trained with 1032 datasets from 127 reference epigenomes. Please refer to the published manuscript at Genome Biology

Download links:

Data of full-stack genome annotations for reference assemblies hg19 can be found here . Within this folder:

├── hg19_genome_100_browser.bb: big bed file of the hg19's full stack annotation. 
This file can only be viewed properly on the ucsc genome browser through the trackhub link:
https://public.hoffman2.idre.ucla.edu/ernst/2K9RS///full_stack/full_stack_annotation_public_release/hub.txt
├── hg19_genome_100_browser.bed.gz: bed.gz version of the bigbed file hg19_genome_100_browser.bb. 
This file can be viewed as a custom track on UCSC genome browser
├── hg19_genome_100_segments.bed.gz: bed.gz version of the segmentation with only 4 columns corresponding to 
chrom, start, end, full_stack state
├── state_annotation_processed_publish.csv: annotations of the states 
(a more complete, excel-format version of this file is Additional File 3 in our published paper) 
├── state_annot_README.md: read me file for file state_annotation_processed_publish.csv
└── trackDb.txt

Data of full-stack genome annotations for reference assemblies hg38 can be found here . Full-stack annotations in hg38 were created by lifting-over the annotation from hg10 to hg38. The whole liftOver pipeline's code is provide here . Within this folder:

├── hg38_genome_100_browser.bb: bigbed file of the hg38's full-stack annotation lifted over from hg19. 
This file can only be viewed properly on the ucsc genome browser through the trackhub link:
https://public.hoffman2.idre.ucla.edu/ernst/2K9RS///full_stack/full_stack_annotation_public_release/hub.txt
├── hg38_genome_100_browser.bed.gz: bed.gz version fo the bigbed file hg38_genome_100_browser.bb. 
This file can be viewed as a custom track on UCSC genome browser
├── hg38_genome_100_segments.bed.gz: bed.gz version of the segmentation with only 4 columns corresponding to 
chrom, start, end, full_stack state
├── state_annotations_processed.csv: annotations of the states 
(a more complete, excel-format version of this file is Additional File 3 in our published paper)
├── state_annot_README.md: read me file for file state_annotation_processed_publish.csv
└── trackDb.txt