Closed kghandour closed 7 months ago
A user should have the ability to select a local photo, and insert a text prompt, and ask the multimodal model about it.
This would be spectacular. bakllava is a great little model.
A user should have the ability to select a local photo, and insert a text prompt, and ask the multimodal model about it.