LLaVA-VL / LLaVA-NeXT

Apache License 2.0
3k stars 258 forks source link

Does LLaVA-NeXT support 336x336 image inputs, like LLaVA-1.5? #257

Open hskim98 opened 2 months ago

hskim98 commented 2 months ago

Hi there! 😊

First of all, thank you so much for your amazing work on LLaVA-NeXT! I was reading about the performance improvements and how it maintains the minimalist design and data efficiency of LLaVA-1.5, which is fantastic! 🎉

I was wondering, does LLaVA-NeXT also support 336x336 image inputs, similar to LLaVA-1.5?

Thanks again for all your hard work—looking forward to hearing from you! 😄

mylesgoose commented 2 months ago

"image_aspect_ratio": "anyres_max_9", "image_crop_resolution": null, "image_grid_pinpoints": [ [ 384, 384 ], [ 384, 768 ], [ 384, 1152 ], [ 384, 1536 ], [ 384, 1920 ], [ 384, 2304 ], [ 768, 384 ], [ 768, 768 ], [ 768, 1152 ], [ 768, 1536 ], [ 768, 1920 ], [ 768, 2304 ], [ 1152, 384 ], [ 1152, 768 ], [ 1152, 1152 ], [ 1152, 1536 ], [ 1152, 1920 ], [ 1152, 2304 ], [ 1536, 384 ], [ 1536, 768 ], [ 1536, 1152 ], [ 1536, 1536 ], [ 1536, 1920 ], [ 1536, 2304 ], [ 1920, 384 ], [ 1920, 768 ], [ 1920, 1152 ], [ 1920, 1536 ], [ 1920, 1920 ], [ 1920, 2304 ], [ 2304, 384 ], [ 2304, 768 ], [ 2304, 1152 ], [ 2304, 1536 ], [ 2304, 1920 ], [ 2304, 2304 ] ], "image_split_resolution": null,