Open s-kyungyong opened 1 year ago
I got the same error,
INFO:root:Failed to generate my_output4/ranked_0.pdb due to CUDA out of memory. Tried to allocate 7.80 GiB (GPU 0; 31.75 GiB total capacity; 24.66 GiB already allocated; 5.82 GiB free; 24.92 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
my command is omegafold a.fa my_output4 --model 2 --subbatch_size 1 --num_cycle 1
and model 1 works just fine and the length of my sequence is 311. any suggestion? thanks
Edit: OOM message was printed within RecycleEmbedder in my case, however I didn't find this class is using subbatch_size.
Hi!
It looks like the run was killed due to the issues with GPU memory usage when model2 is used. However, the same input sequence runs fine with model 1. Do you have any clues?
Using a better GPU didn't help.
Using --subbatch_size also didn't help.
Thanks!