-
## Steps to reproduce the bug
```python
from datasets import load_dataset
dataset = load_dataset("sentence-transformers/parallel-sentences")
```
raises this:
```
/home/phmay/miniconda3/envs…
-
Hello @tomaarsen , I'm a student who loves using sentence-transformers library.
While looking at the codes, I thought that deduplication could be integrated in CachedGISTEmbedLoss (or just GISTEmbe…
-
下载到本地的embedding模型和llm模型都配置model_config里面修改了路径和配置项,但是在初始化知识库的时候报错,
其中的sentence_transformers包也下载过了,还是报错
![20241114153229](https://github.com/user-attachments/assets/cf993e8c-54ac-40b4-b9ee-0b65214dae8…
-
Since OpenSearch 2.13, [**fixed token length algorithm**](https://opensearch.org/docs/latest/ingest-pipelines/processors/text-chunking/#fixed-token-length-algorithm) is available in text chunking proc…
-
Once I installed the sentence transformer package apparently without any problem, when I execute the command "from sentence_transformers import SentenceTransformer" I get the following error:
Modul…
-
### Feature request
The [Sentence Transformers](https://sbert.net/docs/pretrained_models.html) based mpnet models are pretty popular for fast and cheap embeddings. It would be really helpful to supp…
-
Hi SBert Team,
Have you released any models (encoder and decoder) that were pretrained or extensively fine-tuned using the TSDAE loss? I’m particularly interested in the decoder weights from the `D…
-
## 🐛 Bug
I would be expecting the following properties of BERTscore:
1) given a single list of sentences, and comparing all pairs as preds and targets, BERTscore should be maximum when the same se…
-
### System Info
- `transformers` version: 4.45.2
- Platform: Linux-5.14.0-427.37.1.el9_4.x86_64-x86_64-with-glibc2.34
- Python version: 3.9.19
- Huggingface_hub version: 0.23.5
- Safetensors ve…
-
Currently I need to load a tokenizer from HuggingFace, and use it for simply encoding and decoding sentences. While doing that from Transformers.jl interface is awkward already (I had to go `tok = Tra…