Image Captioning Agent using Mistral 7B
12
stars
0
forks
source link
Multimodal Image Captioning Agent using Mistral 7B
- We use quantized Mistral-7B Instruct
- Salesforce BLIP for image captioning using HuggingFace transformers
- Langchain for building custom tools and agent