Closed ymcki closed 1 year ago
Hi @ymcki! It's possible that your basecalling processes are bottlenecked by IO depending on where the data is coming from. There is also an overhead for modbase calling. There are some ongoing discussions about throughput over on the Dorado repository that you may find useful: https://github.com/nanoporetech/dorado/issues?q=performance.
Thanks for your reply. I added the following to nextflow.config
process { withLabel:gpu { maxForks = 1 } }
and then increase basecaller_chunk_size and batch size to dorado until it fills up my VRAM. Now it is running at around 70C and about 30% faster.
What happened?
I have a A100x4 DGX box at work. I am running workflow 1.5.2 passing "-b 256" for fixed batch size to dorado and set basecaller_chunk_size to a value that is one fourth of the number of fast5/pod5. I found that while the GPU are running at 100%. Their temperature is only around 51C to 57C. The power draw is only 194W to 252W.
I was expecting that if a GPU is fully utilized, the temperature should be around 70-80C and power draw should be very close to 300W. Am I under-utilizing my GPU? Is there anything I can do to make it run faster? Thanks a lot in advance.
Operating System
ubuntu 20.04
Workflow Execution
Command line
Workflow Execution - EPI2ME Labs Versions
No response
Workflow Execution - CLI Execution Profile
None
Workflow Version
1.5.2
Relevant log output