Open mneedham opened 10 months ago
Thinking about this more - probably the best way would be to find some things where LLaVA works well and then use LLaVA as the ground truth. We can then see how close BAKLLaVA gets.
Need to see how long the video gets so we can see whether we do it all in one video
Compare this with LLaVA - https://ollama.ai/library/bakllava