Closed osilander closed 1 year ago
Hey @osilander
We have presets for 8GB cards and 12GB cards but not 10GB (i.e. NVIDIA GeForce 3080). You can see the batch size presets for 8GB cards here https://github.com/nanoporetech/dorado/blob/master/dorado/utils/cuda_utils.cpp#L137
I will add support but in the meantime, you can specify manually like so:
$ dorado basecaller dna_r10.4.1_e8.2_260bps_sup@v4.0.0 pod5s/ -b 128 > calls.sam
Hey @osilander
We have presets for 8GB cards and 12GB cards but not 10GB (i.e. NVIDIA GeForce 3080). You can see the batch size presets for 8GB cards here https://github.com/nanoporetech/dorado/blob/master/dorado/utils/cuda_utils.cpp#L137 I will add support but in the meantime, you can specify manually like so:
$ dorado basecaller dna_r10.4.1_e8.2_260bps_sup@v4.0.0 pod5s/ -b 128 > calls.sam
That matrix must have taken a decent amount of benchmarking to optimize, thanks to the dorado team for doing this! It is super, super handy. Other issues aside, the few dorado runs I've tried with standard configs have been a breeze for efficiency. Guppy was a lot of work to not waste GPU node time from inefficient VRAM+SM utilization. We already crossed that hurdle, but it is nice to see it is soon to (usually) be a thing of the past :)
For ease of reference: <html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns="http://www.w3.org/TR/REC-html40">
batch sizes for SUP basecalling -- from https://github.com/nanoporetech/dorado/blob/master/dorado/utils/cuda_utils.cpp#L137 VRAM | Batch size 08 GB | 128 12 GB | 192 16 GB | 256 24 GB | 512 32 GB | 640 40 GB | 1024
I'm trying to basecall on Ubuntu 20.04.5 LTS with two NVIDIA GeForce 3080. Basecalling with high accuracy dna_r10.4.1_e8.2_400bps_hac@v4.0.0 works fine. Using super high accuracy dna_r10.4.1_e8.2_400bps_sup@v4.0.0 results in an immediate out of memory core dump.