-
As state in the latest paper, ' Trained on the LA task, the model exhibits exceptional scene comprehension, reasoning abilities, and multi-round conversation capabilities.'
I am very interested in …
-
So I am trying to Docker containerize sentence-transformers for CPU inference and I cannot figure this out. I have a Dockerfile which deploys perfectly using HuggingFace model checkpoints but for what…
-
**Description**
Our Electra-based model takes about 540 ms per inference on CPU with ONNX Runtime (via the mcr.microsoft.com/azureml/onnxruntime:v1.4.0 container). The same model run through Triton r…
-
### Problem Statement
Nowadays remote model servers like AWS SageMaker, BedRock, or OpenAI, Cohere, etc all support batch predict APIs, which allow users to send large amount of synchronous request…
-
Hi team
Great work on adding onnx and openvino backends, i was trying to test out the onnx backend models and just use a simple import
``` python
from sentence_transformers import SentenceTrans…
-
![image](https://github.com/AIAnytime/ChatCSV-Streamlit-App/assets/53250064/43bbbd7d-104d-4c9f-83fa-96622737984d)
-
Based on [this code](https://github.com/UKPLab/sentence-transformers/blob/master/sentence_transformers/cross_encoder/CrossEncoder.py#L68-L72) it seems that the intention is to support importing a `def…
-
[X] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
I have a locally hosted LLM which I am intending to use as a jud…
-
**Describe the bug**
I installed cuml and found it throws error in running:
**Steps/Code to reproduce bug**
```
import os
import pandas as pd
path_dataset = 'Dataset'
df_all = pd.read_json(…
-
## Situation
Apple Silicon chips (M1 & M2) are based on the ARM64 (aka [AArch64](https://apple.stackexchange.com/questions/451238/is-m1-chip-aarch64-or-amd64), not to be confused with AMD64). There…