-
## Description
**I'm trying to convert a mt5 model to TensorRT. I adapted the T5 demo notebook of the `main` branch, however, the outputs of the TensorRT model are not what they should be. (The ori…
-
Congratulations for the great work!
Is there a plan to release the 6 mT5_SEAHORSE models?
-
**Describe the bug**
Fine-tuning mt5 model, and trying to save the fine-tuned model is throwing a ValueError: You are trying to save a non contiguous tensor:
**To Reproduce**
Attaching a notebo…
-
Hi,
First of all, great work. I am big proponent of FLan-t5 and use it in my projects. For multilingual, mT5 and bigscience/mt0 models provide a good baseline and are truly multilingual. Does Flash…
-
Hi! Thank you for your outstanding work!
I have been working on improving the LangBridge approach, and I noticed your paper referenced it. As you discussed, LangBridge uses soft prompts generated b…
-
1.这个最好也放在项目目录比较好吧毕竟不一定所有人都是所有模型扔c
2.英文的因为报错了我没法试用,日文这个transformer模型是走cpu做推理的
3.这个模型我之前也用过,毕竟是haggingface上唯一一个jp2zh的,但这效果嘛。。。哎。。
你自己看吧。。。
测试用的faster whisper可能听写也有一些错误
我推荐那个什么什么sakura的模型
https://h…
-
Can you tell me how to use other models instead of t5-xxl? (e.g. mt5-base)
-
### Feature request
I am trying to train off-line RL using decision transformer, convert to .onnx.
```
from pathlib import Path
from transformers.onnx import FeaturesManager
feature = "seq…
-
### Your question
Say if I use checkpoint loader load a checkpoint which contains diffusion/clip/vae. And I port only the clip output to following node. Will the unused diffusion and vae part be lo…
-
Training using google/mt5-base as the base model with fp16 and the triplet loss on all-nli data (following the [ example with trainer](https://www.sbert.net/docs/sentence_transformer/training_overview…