Open hskim98 opened 2 months ago
"image_aspect_ratio": "anyres_max_9", "image_crop_resolution": null, "image_grid_pinpoints": [ [ 384, 384 ], [ 384, 768 ], [ 384, 1152 ], [ 384, 1536 ], [ 384, 1920 ], [ 384, 2304 ], [ 768, 384 ], [ 768, 768 ], [ 768, 1152 ], [ 768, 1536 ], [ 768, 1920 ], [ 768, 2304 ], [ 1152, 384 ], [ 1152, 768 ], [ 1152, 1152 ], [ 1152, 1536 ], [ 1152, 1920 ], [ 1152, 2304 ], [ 1536, 384 ], [ 1536, 768 ], [ 1536, 1152 ], [ 1536, 1536 ], [ 1536, 1920 ], [ 1536, 2304 ], [ 1920, 384 ], [ 1920, 768 ], [ 1920, 1152 ], [ 1920, 1536 ], [ 1920, 1920 ], [ 1920, 2304 ], [ 2304, 384 ], [ 2304, 768 ], [ 2304, 1152 ], [ 2304, 1536 ], [ 2304, 1920 ], [ 2304, 2304 ] ], "image_split_resolution": null,
Hi there! 😊
First of all, thank you so much for your amazing work on LLaVA-NeXT! I was reading about the performance improvements and how it maintains the minimalist design and data efficiency of LLaVA-1.5, which is fantastic! 🎉
I was wondering, does LLaVA-NeXT also support 336x336 image inputs, similar to LLaVA-1.5?
Thanks again for all your hard work—looking forward to hearing from you! 😄