Closed mvolar closed 4 months ago
@mvolar sorry for the late reply. Something might be wrong with confusum_ont_20k_filt.fasta
since hifiasm cannot see base pairs. In addition, by looking at the k-mer plot, the HiFi reads also look not good. It would be better to double check both of them.
Yeah it was the fasta file, franky I don't know what happened to it, I have redownloaded it and everything worked out fine, producing a decent N50 in the assembly. Although UL integration joined 2 chromosomal arms over the centromere in a single conting, producing a chromosome 90mb in size (our whole genome is cca 300mb) but everything else worked fine.
That's great!
Hello,
using hifiasm:
I get the attached log. Previously we have managed to run hifiasm on multiple PacBio only data, for similar genome sizes, for this species we have decided to use UL integration as we have the data. However the assembly fails at the UL integration step.
The worrysome part is the end of logging, where it looks like that the UL reads have not even been read into the assembly process, i.e. it finds the correct number of reads (~270k, but no bases):
However, the reads appear to be normal FASTA reads, and the output of
seqtk comp
is here:Logging of the assembly process:
logging.log