torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 774.00 MiB (GPU 0; 11.76 GiB total capacity; 10.58 GiB already allocated; 697.94 MiB free; 10.61 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 0 (pid: 496499) of binary: /usr/bin/python3
I started to take an interest in AI recently, I am grateful in advance for the people who will help me.
Edit : If there is also a way to learn with only the cpu, I am also interested
Hi, I am trying to train the model
llama-7b-hf
with single GPU. I tried to reduce some parameters but I don't know if they are better.Components of my pc :
Command execution :
Error :
I started to take an interest in AI recently, I am grateful in advance for the people who will help me.
Edit : If there is also a way to learn with only the cpu, I am also interested