Out of memory on CUDA - Githubissues

RuolinZheng08 / twewy-discord-chatbot

Discord AI Chatbot using DialoGPT, trained on the game transcript of The World Ends With You

https://www.freecodecamp.org/news/discord-ai-chatbot/

MIT License

317 stars 156 forks source link

Out of memory on CUDA #16

Closed Eronponce closed 1 year ago

Eronponce commented 2 years ago

I already tried to reduce my batch to 1, and I am looking for help on how to run it better, my GPU has 6 GB of dedicated VRAM, and still is out of memory, maybe I need to reduce my sample?

Jerry288 commented 2 years ago

Try resetting the runtime. Maybe that would work.

jdakillah commented 1 year ago

You can try to use google colab for generating the model, then afterwards you can copy the files to your server of Huggingface instance.

remenraj commented 1 year ago

Which model are you using? Maybe using a smaller model will help. Also lookup gradient accumulation, it can also help reduce the memory required for each batch.

Spit-fires commented 1 year ago

Use a lower batch size in args()