-
System information
Linux 20.04
pip Tensorflow==2.12.0
using tranformers WhisperForConditionalgeneration
I'm trying to convert from TF to tflite and quantized to int8 Whisper, using the whisper …
-
I want to add a easy to medium question on NLP category related to Transformers positional encoding upon the input embeddings. The question will be done using only `tensorflow as tf, tf.cast(), tf.con…
-
Hello!你们有在EVA-CLIP上验证过论文中的方法吗,根据你们论文中提供的思路(在tranformer的最后一个block 1. 去掉残差 2.attn计算改为qq 3.去掉FFN)在EVA-CLIP上进行可视化attn,看起来噪声还是比较大的
-
Hi!
Let's bring the documentation to all the Italian-speaking community :)
Who would want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/transformers/blob/m…
-
### What happened?
I hope I'm doing something wrong than there is a problem with Kustomize
I'm using this file
```yaml
apiVersion: builtin
kind: PatchTransformer
metadata:
name: example
…
-
Hi, I am a big fan of MeZO. In your paper, you mentioned that gradient checkpointing was not used. However, the following code in trainer.py seems to enable it. I am a bit confused about whether the 4…
-
### Describe the issue
When I quantize the SAM model by **onnxruntime.quantization.quantize_static interface**, the program crashed.
In wondows11 pycharm, got this message: Process finished with e…
-
### System Info
tranformers 4.47.0, python 3.11
### Who can help?
@ArthurZucker (I think)
### Information
- [X] The official example scripts
- [ ] My own modified scripts
### Tasks
- [ ] An off…
-
Hello.
I downloaded AI Playground v1.22.1 for desktop GPUs which has a built-in "LLM picker" but unfortunately the dGPU version does not provide Mistral model- as mentioned in the release notes (it h…
-
Carnegie Melon Architecture