GW-HIVE / biomuta-old

Documentation: https://biomuta.readthedocs.io/en/latest/
0 stars 0 forks source link

Liftover hg19 -> hg38 #17

Open mariacuria opened 1 week ago

mariacuria commented 1 week ago

Secondary issue.

mariacuria commented 3 days ago

Part of #16

mariacuria commented 2 days ago

@jeet-vora I have over 28 million GRCh37 records. UCSC LiftOver command line tool was able to map all but ~14K of them. The bulk of unmapped positions are records on chr24 which is presumably chrY (~8K) and mitochondrial DNA positions (~4K). My thoughts:

Do you have advice on how to deal with it?

mariacuria commented 1 day ago

https://crossmap.sourceforge.net/ Command-line tool and chain files that ENSEMBL uses (I'm going to use only the chain file to convert positions that UCSC tool was unable to map).