mbzuai-oryx / GeoChat

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
https://mbzuai-oryx.github.io/GeoChat
356 stars 23 forks source link

【GPU Memory】 #6

Closed Luo-Z13 closed 4 months ago

Luo-Z13 commented 6 months ago

Hello, I'm wondering about the minimum GPU memory required for training. Could you provide some information on this?

KjAeRsTuIsK commented 6 months ago

Hi @Luo-Z13 , thank you for your interest. We trained the model on 4 A100 40 GB gpus. You can train on one A100 80GB or on a single 40 GB A100 by using the quantised models,in 4 or 8 bit.

vvuonghn commented 5 months ago

How long your model training?

KjAeRsTuIsK commented 4 months ago

Hi @vvuonghn, we finetuned the model for around 10 hrs for the complete dataset, and further fine-tuned for 4-5 hours on the grounding part of the dataset. Please let me know if you have any further queries.