Open msw6468 opened 5 months ago
Thanks for the nice research!
In the case of the Vision-Language Model using the 3M dataset, I am curious about what setting you used for pertaining(which GPUs, how many GPUs) and how much time it took.
Thank you.
Thanks for the nice research!
In the case of the Vision-Language Model using the 3M dataset, I am curious about what setting you used for pertaining(which GPUs, how many GPUs) and how much time it took.
Thank you.