Dmmm1997 / SimVG

[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
https://arxiv.org/abs/2409.17531
MIT License
44 stars 0 forks source link

when to release the code #1

Closed VoyageWang closed 1 week ago

VoyageWang commented 1 month ago

Hi, when will the code be released? The training computational resources you used are a single 3090 GPU or 8 GPUs, which is not clearly stated in your paper. Thanks for your great work!

Dmmm1997 commented 1 month ago

Thank you for your attention to our work. The code is being sorted out.

Pre-training requires 8 RTX3090 to train for about 1 day. A single dataset such as refcoco/+/g only requires one RTX3090 to train for about half a day.