zengxiaofei / HapHiC

HapHiC: a fast, reference-independent, allele-aware scaffolding tool based on Hi-C data
https://www.nature.com/articles/s41477-024-01755-3
BSD 3-Clause "New" or "Revised" License
140 stars 10 forks source link

Wrong assembly of X chromosome #46

Closed gotouerina closed 2 months ago

gotouerina commented 3 months ago

I use haphic to anchor a genome to chromosome level but the results may be wrong. The X chromosome was not successfully assembled.

zengxiaofei commented 3 months ago

It's very normal. Current scaffolding tools cannot guarantee perfect results. Manual adjustment in Juicebox is typically necessary. In addition, there are also other reasons that may lead to poor results. For example, if you are scaffolding a haplotype-resolved assembly, the Hi-C depths of the non-PAR regions on the sex chromosomes will be higher than those of autosomes, which can disrupt the clustering step during the scaffolding. In this case, you can refer to the parameters we used for the human genome HG002:

$ haphic pipeline <asm.fa> <filtered_HiC.bam> 46 --remove_allelic_links 2 --density_upper 1 --remove_concentrated_links --normalize_by_nlinks

HG002_X

zengxiaofei commented 2 months ago

Close this issue as there has been no response for two weeks.