-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Current Behavior
v1.1,加载时报错 Unable to load weights from pytorch checkpoint file for './model/chatglm-6b/pytorc…
-
within the docker (IMAGE: nvidia/cuda:12.1.0-devel-ubuntu22.04)
GPU: A100 40GB
TensorRT-LLM version: 0.10.0
flash-attn 2.5.9.post1
I quantize the phi3 model(phi-3-medium-128k-instrcut/), wi…
-
### Describe the bug
Trying to load local CKPT file using the "from_single_file()" method fails. Works fine with .safetensors file from same repo (Runway ML SD).
### Reproduction
```import to…
-
### System Info
- `transformers` version: 4.45.1
- Platform: Linux-5.15.154+-x86_64-with-glibc2.35
- Python version: 3.10.14
- Huggingface_hub version: 0.25.0
- Safetensors version: 0.4.5
- Acce…
-
Hi,
Does Unsloth support AMD GPUs?
Thank you!
-
### System Info
- `transformers` version: 4.44.2
- Platform: macOS-14.4-arm64-arm-64bit
- Python version: 3.12.2
- Huggingface_hub version: 0.24.5
- Safetensors version: 0.4.3
- Accelerate versi…
-
[instagirl_config.json](https://github.com/user-attachments/files/16643076/instagirl_config.json)
### What happened?
I've been experiencing the same error for the last few months every time I tr…
-
_Downstream PyTorch issue:_
https://github.com/pytorch/pytorch/issues/133780
I'm trying to do attention on a batch-of-zero, because my program uses a static graph and I rely on zero-batching (in…
-
Validation sanity check: 0it [00:00, ?it/s]/home/wyd21/miniconda3/envs/t5/lib/python3.9/site-packages/pytorch_lightning/trainer/data_loading.py:105: UserWarning: The dataloader, val dataloader 0, does…
-
### 🚀 The feature, motivation and pitch
Sparse Causal Flash Attention as implmented [here](https://github.com/epfml/dynamic-sparse-flash-attention) and described in [this paper](https://arxiv.org/abs…