Closed SHuang-Broad closed 3 years ago
Hello Steve, I suspect that some of the reads are too large to align on the graphics card so they are left for the CPU aligner? No idea really, maybe @tijyojwad can answer this inquiry.
Best regards, Robert
Yeah, that's what I suspect too. If that indeed is the case, maybe it is limited by the memory on GPU?
Hi @SHuang-Broad , that's indeed what's happening. Right now we have some upper limits on the size of sequences per alignment. I'm working on a chance to racon right now where instead of hard coding the upper limit we calculate that based on all the overlaps available. I think this should allow many more of the overlaps to be aligned on the GPU. I'll be submitting a PR for this in a few days, so hopefully that'll speed things up for you. Will ping this PR when it's done.
That's awesome, Joyjit!
Thank you both!
I will leave this open and you can close
this ticket in the PR, if appropriate.
Hi @SHuang-Broad - can you try the new version of racon
with some updates to CUDA alignment integration? the code is now handling the task distribution better, so more alignments should go to the GPU. empirically --cudaaligner-batches 8
gives good results
Hi Robert,
while I'm running racon on my draft asm using GPUs, I observed that there are two
aligning overlaps
steps, the first using GPU, relatively quick, and the second using CPU, taking relatively longer time.Am I setting parameters in a wrong way, or is this expected?
My biggest contig is about 90M, NG50 ~ 11M, LG50 ~70, out of 3000 ~ 4000 (un-scaffolded) contigs on a primate genome.
Thanks, Steve