-
I'm tried to convert model to coreML
firs tI tried it in terminal using this code
and generate two encoder and decoder files
```
import torch
from PIL import Image
from torchvision import tran…
-
I am working with [madlad400 ](https://huggingface.co/google/madlad400-3b-mt) which is a encoder decoder model based on T5 architecture. I am able to load it in TensorRT LLM in the bfloat16 type . I w…
-
**Description**
I am using the Sagemaker Triton Inference Server containers to run a MultiModel endpoint. One of the models is a MT5 model. I am trying to optimise for the latency and think I am losi…
-
### System Info
Python version 3.11
- `transformers` version: 4.42.3
- Platform: Windows-10-10.0.22631-SP0
- Python version: 3.11.0
- Huggingface_hub version: 0.23.4
- Safetensors version: 0.4.2…
-
Develop NN model. Use input from medical jargon lookup tables as test data
-
Hi @csukuangfj,
With https://github.com/k2-fsa/sherpa-onnx/pull/992, config for backends are handled as arguments and it is done.
But there is an additional issue with arguments and models, as t…
-
I cannot _sheeprl-eval_ my trained model, since the keys in the world model's state_dict have different names:
Stacktrace
Error executing job with overrides: ['checkpoint_path=/home/drt/Deskto…
-
create an encoder-decoder model:
```Python
def get_encoder(input_shape):
input_tensor = keras.Input(input_shape, dtype='float32')
x1 = keras.layers.Conv2D(8, 3, padding='same')(input_ten…
-
**Describe the bug**
I am trying to run Whisper on an AMD Radeon 780M Graphics using DirectML EP but it is showing the Not Implemented error below.
**To Reproduce**
python -m pip install onnxrunt…
WA225 updated
14 hours ago
-
Hello! I'm trying to load a pre-trained model but I got a lot of missing keys: