-
### 🚀 The feature, motivation and pitch
Hi vLLM team,
As you already know, T5 is a perfect model and as mentioned in #7366, the project intends to add support for T5. I want to help the cause and co…
-
Unique ops in T5
- Eltwise
- multiply
- add
- sqrt
- subtract
- recip
- exp
- Reduce Avg
- Reduce Max
- Reduce Sum
- Reshape
- Matmul
- hslice
- hstack
- transpose
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [ ] I am running the latest code. Development is very rapid so there are no tagged versions as of…
-
Hello everyone,
First off, a big thanks to city96 for the awesome work they've been contributing to the community. It's been incredibly helpful!
Here are my system specs:
Processor: Intel i5-13…
-
I tried to load a T5 model but it seems not supported.
```
---------------------------------------------------------------------------
NotImplementedError Traceback (most re…
-
### System Info
Linux gaudi2-wsf-test 5.15.0-92-generic #102-Ubuntu SMP Wed Jan 10 09:33:48 UTC 2024 x86_64 x86_64 x86_64 GNU/Linux
docker image: ghcr.io/huggingface/tgi-gaudi:2.0.0
###…
-
### System Info
L4 GPU (AWS G6.12xl) with TensorRTLLM 0.11.0, running with Tritonbackends
### Who can help?
_No response_
### Information
- [ ] The official example scripts
- [ ] My own modified …
-
how to save "model.safertensor" which is download from huggingface T5-base and where should I put? Should I need to rename it as "t5-base. safertensor"?
another question
FileNotFoundError: [Errno…
-
I tried to visualize the attention maps for the T5 model but have encountered issues while getting the plots.
I would like to emphasize few points:
- I have used `model.generate` because I don't …
-
Hello!
Are there any plans on implementing quantized-t5 models on CUDA devices?
I'm looking for a couple of days to find the solution or implement a CUDA support for https://github.com/huggingface/c…