Open ddelgadillod opened 3 years ago
Hi @ddelgadillod When scaffolding the tetraploid sugarcane genome, I first assigned contigs into 10 homologous groups based on its close relatives, sorghum genome following this wiki guideline (https://github.com/tangerzhang/ALLHiC/wiki/ALLHiC:-scaffolding-an-auto-polyploid-sugarcane-genome). After that, contigs in each group were subject to ALLHiC phasing pipeline as you showed above. Did you use the same method when you mean that you ran every step in the wiki tutorial for sugarcane? There are a couple of factors that may affect the phasing and scaffolding according our experiences:
Can you perform the dotplot analysis between the ALLHiC scaffolds and diploid reference? This will help you check the chimeric scaffolds in ALLHiC results.
Hi, The problem was the group assignment to the homologous groups. We divided the contigs into the 12 homologous chromosomes divided by two, thus 24 (chri_1, chri_2) because we are using as a reference a haplotype assembly of a diploid potato. For sugarcane, were you able to recover the haplotype assembly for each chromosome? What is your suggestion to obtain a haplotype assembly of a tetraploid potato using a diploid reference assembly to separate the contigs?
Did you use purge_haplotigs or something similar after using allhic?
Looking forward to an answer Best Regards
Hi
I'm running AllHiC to phasing/scaffolding tetraploid potato genome, with draft assembly of 6692 contigs and 2.4 Gb length I run every step in the wiki tutorial for sugarcane, How can I improve grouping and scaffolding, and how can I avoid extremely longer contig, I can see that intermediate group1 files (txt, .tour) are much longer than other group' files.
I run:
At partition step I run with values k = 4,6,12,48, the restriction enzyme is DpnII GATC same that MBOI
At this point, I run a Quast analysis and see that with every k value, I obtain a supercontig of 2.2 Gb and a total length of 2.4Gb in groups.asm.fasta
I review every step and resulting logs and everything run success.