2) Is there a way to use Multiple GPUs for loading Share Captioner?
3) Are there Quantization methods (4bit, 8bit) available for Share Captioner?
I have tried to run Share Captioner but I have got CUDA Out of Memory error on a Rtx 3090:
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 172.00 MiB. GPU 0 has a total capacty of 24.00 GiB of which 0 bytes is free. Including non-PyTorch memory, this process has 17179869184.00 GiB memory in use. Of the allocated memory 23.10 GiB is allocated by PyTorch, and 3.24 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
Hi
1) How much VRAM is required for Share Captioner?
2) Is there a way to use Multiple GPUs for loading Share Captioner?
3) Are there Quantization methods (4bit, 8bit) available for Share Captioner?
I have tried to run Share Captioner but I have got CUDA Out of Memory error on a Rtx 3090:
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 172.00 MiB. GPU 0 has a total capacty of 24.00 GiB of which 0 bytes is free. Including non-PyTorch memory, this process has 17179869184.00 GiB memory in use. Of the allocated memory 23.10 GiB is allocated by PyTorch, and 3.24 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF