-
### System Info
- `transformers` version: 4.46.2
- Platform: Linux-5.10.226-214.880.amzn2.x86_64-x86_64-with-glibc2.26
- Python version: 3.10.14
- Huggingface_hub version: 0.26.2
- Safetensors ve…
-
As reported by @ArthurZucker:
> Quick question, I am seeing this in peft: https://github.com/huggingface/peft/blob/f2b6d13f1dbc971c7653aa65e82822ea2d84bb38/src/peft/peft_model.py#L1665 where there …
-
https://www.etched.com/announcing-etched
-
On website: https://www.sbert.net/docs/sentence_transformer/pretrained_models.html, we can see the function "model.encode" use parameter "prompt". However, I didn't see that "prompt" parameter were me…
-
### Problem Description
I get these errors often from [various applications](https://github.com/pytorch/pytorch/issues/134208), this one if from ComfyUI.
Is scaled_dot_product_attention part of fl…
-
### 🐛 Describe the bug
I'm encountering a ValueError when trying to load the Qwen2-VL model using the AutoLigerKernelForCausalLM class from the Liger Kernel. The error message indicates an unrecogn…
-
### 🚀 The feature, motivation and pitch
Hello
We are currently using a number of different transformer models (plain BERT encoders with attached classification head) on Android. In order to increa…
-
It seems that the code repository’s `hypes_yam`l folder does not provide the configuration file for where2comm.
-
**Describe the bug**
I am using the `train_gpt3_175b_distributed.sh` script to launch training on a single node with 4 A100 80GB GPUs. Training goes well if I use tensor parallel or pipeline parallel,…
-
When using: **Mistral 7b Text Completion - Raw Text training full example.ipynb**
**Last block errors with:**
`Exception in thread Thread-17 (generate):
Traceback (most recent call last):
File…