Open todayisYu opened 2 weeks ago
Thanks for reaching out. I think you are right. Ideally, the training code needs about slightly less than 40GB VRAM which can be trained with an A100 40G. You can try to use a smaller batch size. I do not think torch version will solve the issue.
Thanks for reaching out. I think you are right. Ideally, the training code needs about slightly less than 40GB VRAM which can be trained with an A100 40G. You can try to use a smaller batch size. I do not think torch version will solve the issue.
Thanks for replying! Actually, I have 4 2080Ti 11G, Can I have a try? If possible , how can I modify the code?
I am not 100% sure but I think you can give it a try. But you need to use some model/data/pipeline parallelism trick. Use deepseek might also be helpful. You need to try to add these modules to the current code.
Hi,I also have a problem with training TWOSOME in Tomato Salad environment
pygame 2.4.0 (SDL 2.26.4, Python 3.9.20) Hello from the pygame community. https://www.pygame.org/contribute.html You are using the default legacy behaviour of thesh scripts/tomato_salad_ppo_llm.sh
and encountered the following error: