Closed Luo-Z13 closed 4 months ago
Hi @Luo-Z13 , thank you for your interest. We trained the model on 4 A100 40 GB gpus. You can train on one A100 80GB or on a single 40 GB A100 by using the quantised models,in 4 or 8 bit.
How long your model training?
Hi @vvuonghn, we finetuned the model for around 10 hrs for the complete dataset, and further fine-tuned for 4-5 hours on the grounding part of the dataset. Please let me know if you have any further queries.
Hello, I'm wondering about the minimum GPU memory required for training. Could you provide some information on this?