-
### Description
```shell
branch: v5.1
docker_image: nvidia/pytorch:21.11-py3
gpu: T4
Error:
Traceback (most recent call last):
File "../examples/pytorch/t5/summarization.py", line 382, in
…
-
---------------------------------------------------------------------------
RuntimeError Traceback (most recent call last)
/usr/local/lib/python3.6/dist-packages/tensorf…
-
### System Info
**Environment info:**
- transformers: 4.19.2
- Platform: Linux elementary OS 6.1 Jólnir
- Python version: 3.8.10
- PyTorch version: 1.12.1+cu113
- Using GPU in script?: No
- U…
-
### Description
```shell
Is it possible to support mt5 acceleration by changing the activation function of mt5 to relu? Are there other things to pay attention to?
```
### Reproduced Steps
```shel…
-
In your paper(Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction), the classification f1 about en to ar is 44.8(mT5-large). But in github readme.md, the clas…
-
### System Info
- `transformers` version: 4.21.3
- Platform: Linux-5.4.0-122-generic-x86_64-with-debian-buster-sid
- Python version: 3.7.12
- Huggingface_hub version: 0.10.0
- PyTorch version (…
-
### System Info
transformers : 4.18.0
torch: 1.12.0
Python 3.7.13
### Who can help?
@patrickvonplaten @patil-suraj
### Information
- [ ] The official example scripts
- [X] My own modified scri…
-
你好,我在https://github.com/IDEA-CCNL/Fengshenbang-LM/blob/main/fengshen/examples/pretrain_t5/pretrain_randeng_t5_large.sh
中发现使用了这样的参数设置
![image](https://user-images.githubusercontent.com/110814689/1988…
-
![image](https://user-images.githubusercontent.com/6379332/197462007-1d2e793d-2744-43ef-82c0-2bdff40a61ea.png)
-
```shell
Unsupported ops: Counter({'broadcast_matmul': 89, 'scalar_div': 4, 'gather': 4, 'elementwise_minimum': 3, 'fill_': 2, 'scalar_logical_less': 2, 'where': 2, 'scalar_logical_greater': 1})
```