-
### 🐛 Describe the bug
When calling sdpa, I get the error message `RuntimeError: The size of tensor a (256) must match the size of tensor b (2) at non-singleton dimension 2` -- there are no parameter…
-
I am trying to follow your code for making custom longformer for XLM models (typically XLM-Roberta), however, I get NaN values as soon as I start training my models for a downstream classification. He…
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…
-
## Environment info
- `transformers` version: 4.11.3
- Platform: Linux-4.19.128-microsoft-standard-x86_64-with-glibc2.2.5
- Python version: 3.8.12
- PyTorch version (GPU?): 1.9.1+cu102 (False)…
-
### Issue Type
Bug
### Source
source
### Keras Version
3.3.3
### Custom Code
Yes
### OS Platform and Distribution
Ubuntu 20.04.6 LTS
### Python version
3.10
### GPU model and memory
_No r…
-
I've got checkpoints from HuggingFace and put them under the correct folders, but errors showed that tokenizer for "xlm-roberta-large" couldn't be loaded. Any other essential model files needed here f…
-
在Reranker文件下的README中您指出了微调BCERanker这样的多语言模型的时候,需要使用XLMroberta的配置文件,那BCEmbedding也是同样是多语言的,是否也应该使用XLMroberta配置呢?
-
**Describe the bug**
When evaluating during training using the ClassificationModel. model during .train gets stuck (the bar remains empty) during the validation set predictions, with no progress (eve…
-
# Bug Report
## Description
Failure to download the reranker model from the UI, UI hangs and healthcheck fails. Need to rebuild the container every time.
```
"GET /ws/socket.io/?EIO=4&transport=…
-
### 请提出你的问题
# 我查找了api文档,里面只有载入XLM的方法,请问是现在暂时没有实现XLMRobertaModel模型吗?
如果是和XLM库融合在一起了,那么我在蒸馏学习的时候是否能使用这样的代码进行实现呢
```
from paddlenlp.transformers import XLMModel, XLMTokenizer
teacher_tokenizer = X…