-
Thank you for the great model.
I wonder how can I get the multimodat embedding of different inputs like image and its caption usign Imagebind?
if I can get that then how can it be compared to CL…
-
I am trying to run the following code but it is giving error. please assist!
```
import mlx.core as mx
from mlx_vlm import load, generate
model_path = "google/paligemma-3b-mix-448"
model, pro…
-
When the smart connection is loaded, it always displays this and cannot be displayed correctly.When I download this plug-in, do I need to install additional software?
System: macOS
![image](https:…
-
**Please check the FAQ documentation before raising an issue**
**Describe the bug (__required__)**
Upon running the python script to create a knowledge graph, I keep getting an error which see…
-
I am using pre trained weights to get embedding and then calculating difference to ReID of images. But I am not geting results as I was expection and mentioned in paper. Please let me know do I need…
-
I have the encoder image_embeddings in a text file in my root project directory. When i try to read the text file with encoder embeddings, the react-native can able to read the file. But if i pass the…
-
Your pretrained/docbank model doesn't have image embeddings, does it?
only the contextualized embed + bbox embedding?
-
我在进行第6步推理时,出现问题:
```
(clip)XXX@junqian-Tower-X:~/projects/python/CLIP-TPU$ python3 embeddings_bmcv.py --img_dir ./datasets/imagenet_val_1k --image_model ./models/BM1684X/clip_image_vitb32_bm1684x_f1…
-
Retrieval Augmented Generation (RAG) is a process by which relevant documentation is selected from a corpus and appended to a the prompt. This enables specialised and highly focused context to be adde…
-
I attempted to set the shape of the encoder input boxes as (4, 10, 4), representing (bs, num_boxes, 2 box corners). However, during the operation:
def _embed_boxes(self, boxes: torch.Tensor) -> tor…