Closed double-fire-0 closed 1 month ago
Are there any other considerations besides saving GPU memory?
It's from LLaVA.
BTW, the pretrain stage of Bunny-v1.1-Llama-3-8B-V is under zero3.
thx
Are there any other considerations besides saving GPU memory?