-
### Bug Description
I'm following this notebook demo: https://github.com/run-llama/llamacloud-demo/blob/main/examples/report_generation/report_generation.ipynb
But got a validation error from Llama…
-
**Is your feature request related to a problem? Please describe.**
I'm frustrated when I can't use multimodal models like "gpt-4-vision-preview" in Cheshire-cat-ai to process and retrieve information…
-
### Project Name
VidSage
### Description
# VidSage: Video Insights using Graph RAG
https://www.youtube.com/watch?v=IUSCWtB9jWk
VidSage focuses on processing video data, storing it in Azur…
-
RAG is Retrieval Augmented Generation. For example. if i pass a picture, will it find a similar?
-
Issue is to track efforts of parsing PDFs and any articles/documents relating to this.
Currently 'marker' is used https://github.com/VikParuchuri/marker
This requires a separate venv and I have do…
-
## List
- tutorials
- [ ] #4 - @seochan99
- [ ] #5 - @seochan99
- [ ] #6 - @seochan99
- [ ] #17 - @bananana0118
- [ ] graph.mdx
- [ ] index.mdx
- [ ] llm_chain.mdx
- [ ]…
-
Currently prompt2model is limited to text input text output tasks. The underlying framework can certainly handle different modalities, and it would be great to see prompt2model be able to handle diffe…
-
Hi, I would like to ask if I want to train this model on some of **my own 3D models** to achieve **Multi-modal 3D Shape Retrieval task**, what do I need to do with the original 3D data to provide the …
-
#### Introduction
Vector databases have gained significant importance due to the rise of AI, machine learning, and deep learning applications. These databases store high-dimensional vectors repre…
-
Hi! First of all thank you for your work, it has been quite easy and performant to use so far.
I am currently confused looking at the forward() method of your visualization: https://github.com/salesf…