Closed balwierz closed 5 months ago
That is a fundamental problem with the tool liftOver itself that we do not know yet how to fix.
(2) After lifting 200-bp segments from hg19 to hg38, we would like to get rid of regions in hg38 that are mapped from multiple regions in hg19. We have coded this step with the implicit assumption that the length of 200-bp segments from hg19 mapped to hg38 are 200-bp. Therefore, it missed this case where the the end of one segment (148712700) is greater than the end of the overlapping segments (146872566, 146872966, 146873766).
Our solution at the moment: We reran the liftOver to resolve all these issues by simply removing all segments in hg38 that are mapped from 200-bp segments in hg19 but are not strictly 200-bp in hg38. It means that we strictly limit the liftOver to segments in hg19 have a unique 200-bp mapped hg38 segment. The data to download these files can be found at https://public.hoffman2.idre.ucla.edu/ernst/2K9RS//full_stack/full_stack_annotation_public_release/hg38. These are the files you can use:
I typed my answer into a word document, and not sure why it does not allow pasting as text. So here is the link where you can download the hg38 chromatin state maps (please see above answer for the files that are of interests to you): https://public.hoffman2.idre.ucla.edu/ernst/2K9RS//full_stack/full_stack_annotation_public_release/hg38
Hello! We have updated our readme with new links to the data, such that the problem of overlapping annotations are no longer present. Is it okay if we close this issue? We will keep it open in 3 days and then will close it!
Hi, I found that there are cases where regions overlap. For instance the first line below overlaps with the 3 subsequent regions.