gcui-art / album-ai

AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery.
http://album.gcui.ai
Apache License 2.0
759 stars 74 forks source link

Can it be developed for local use? ollamam #4

Open bookbindinggithub opened 1 month ago

bookbindinggithub commented 1 month ago

Can it be developed for local use? ollamam

blueeon commented 1 month ago

In the roadmap, need to prioritize getting Vision running locally.

flyingfz commented 1 month ago

这个模型: llava-phi3:3.8b 可以实现图片的向量生成. 可以在 ollama 的模型里找到.

flyingfz commented 1 month ago

记错了, llava-phi3:3.8b 这个模型是生成图片的文本描述(英文), 还需要搭配一个文本转向量的模型。

gandli commented 1 month ago

Localization, this could be a good direction

blueeon commented 1 month ago

@flyingfz Okay, let’s study it.