GAIR-NLP / anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
https://huggingface.co/spaces/ethanchern/Anole
618 stars 33 forks source link

How's the image understanding performance? #24

Open MonolithFoundation opened 1 month ago

MonolithFoundation commented 1 month ago

How's the image understanding performance?

JoyBoy-Su commented 1 month ago

Hi, thanks for your interest! We will evaluate anole's image understanding performance through some benchmarks as soon as possible.