Luodian / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
https://otter-ntu.github.io/
MIT License
3.54k stars 242 forks source link

HI, I'm evaluating Otter in Coco Image captioning and am getting quite poor performance for 0-shot vs 2-shot or 4-shot. #289

Closed essamsleiman closed 9 months ago

essamsleiman commented 9 months ago

Before you open an issue, please check if a similar issue already exists or has been closed before.

When you open an issue, please be sure to include the following

Thank you for your contributions!