chhylp123 / hifiasm

Hifiasm: a haplotype-resolved assembler for accurate Hifi reads
MIT License
528 stars 86 forks source link

The kmer graph distribution is strange #232

Open pzx6 opened 2 years ago

pzx6 commented 2 years ago

Hello,

This is similar to issue#93 but for me it is unsolved yet. I am running hifiasm v 0.16.1-r375 to assemble ~1Gb genome. My hifi fasta file size is about 50Gb . All files are output normally, but The test.p_ctg.gfa size (2.3Gb)is much larger than estimated, and the contig (~300kb) is also much shorter than that obtained by ONT(~800kb),Genomic survey revealed a heterozygosity of 0.56%, repetitive sequences ~60% My command hifiasm --primary -t 30 -o test test.fa.gz 2>test.log

The log is attached. test.log

76b671383040a519a0e15bd40fce8ec

Any idea what's wrong there?

lh3 commented 2 years ago

This is probably not HiFi data.

pzx6 commented 2 years ago

This is probably not HiFi data. Part of the file sendpix1

chhylp123 commented 2 years ago

As Heng said, it might be not HiFi reads or low-quality HiFi reads. There are several examples (see: https://hifiasm.readthedocs.io/en/latest/faq.html#why-does-hifiasm-stuck-or-crash).