-
### Question
I've tried multiple models from MTEB dashboard (e.g. `jinaai/jina-embeddings-v3`, `jinaai/jina-embeddings-v2`, `dunzhang/stella_en_400M_v5`), but none of them work.
It's not clear whi…
-
Randomly running into this error on an A100 SXM4 80GB, wonder if you have faced it and whether I am doing something wrong.
Above is my GPU setup
```
File "/root/src/battle_ax/models/multimodal_…
-
### System Info
```
@xenova/transformers": "^2.17.2
@huggingface/transformers:"^3.0.2"
```
### Environment/Platform
- [ ] Website/web-app
- [ ] Browser extension
- [ ] Server-side (e.g…
-
## 論文タイトル(原文まま)
Scalable Diffusion Models with Transformers
## 一言でいうと
従来のU-Netを代替するトランスフォーマーベースの新しい拡散モデル「Diffusion Transformers (DiTs)」を導入し、計算効率を高めつつ画像生成の品質を向上させる手法を提案。
### 論文リンク
[https://www…
-
Recent changes in Huggingface Transformers (https://github.com/huggingface/transformers/commit/cdee5285cade176631f4f2ed3193a0ff57132d8b and https://github.com/huggingface/transformers/commit/4a3f1a686…
-
I tried to transcribe an hour-long audio, but I got this error. I had good results with a two-minute task attempt, so I wanted to try the long audio. Is there any way to fix it? Thank you.
```pytho…
-
Is there a strict requirement for GPUs that support flash_attention? I tried to test on V100, but this GPU does not support flash_attention, resulting in an error with the Runtime Error: No available …
-
### Describe the bug
Unable to use flux fp8 model from `Kijai/flux-fp8` while having transformer_flux.py file in local. I have modified the scripts to remove any import error. I put some print stat…
-
### Feature request
Implementation of:
PatchTSTModel, PatchTSTConfig, & Trainer
Source:
https://github.com/huggingface/transformers/blob/v4.46.3/src/transformers/models/patchtst/modeling_patchts…
-
Following instructions in [HyperPod EKS workshop](https://catalog.workshops.aws/sagemaker-hyperpod-eks/en-US/02-fsdp/02-train), trying to run FSDP EKS example on 2 p5 nodes is failing with the followi…