Open leizhu-angus opened 1 year ago
Does the ALBEF model support fine-tuning RefCOCO+ in a fully supervised setting for visual grounding tasks?
Does the ALBEF model support fine-tuning RefCOCO+ in a fully supervised setting for visual grounding tasks?