Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs
334
stars
65
forks
source link
Can he implement a similar function, such as uploading a document to a knowledge base containing an image. Then the user uploads an image, which can retrieve the image and know its location, such as indoor navigation, images of each room, and can upload one of the images for path planning and navigation #5
Can he implement a similar function, such as uploading a document to a knowledge base containing an image. Then the user uploads an image, which can retrieve the image and know its location, such as indoor navigation, images of each room, and can upload one of the images for path planning and navigation
Can he implement a similar function, such as uploading a document to a knowledge base containing an image. Then the user uploads an image, which can retrieve the image and know its location, such as indoor navigation, images of each room, and can upload one of the images for path planning and navigation