Hello, I am very grateful for your detailed open source work. I want to repeat your experiment on llama3-8b, but when I run Taylor's experiment, it appears CUDA out of memory. I use one A100 GPU with 40G memory. could you please provide some solutions.
Hello, I am very grateful for your detailed open source work. I want to repeat your experiment on llama3-8b, but when I run Taylor's experiment, it appears CUDA out of memory. I use one A100 GPU with 40G memory. could you please provide some solutions.