-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
- `llamafactory` version: 0.9.1.dev0
- Platform: Linux-5.15.0-125-generic-x86_64-with-glibc2.31
-…
-
### 🐛 Describe the bug
The CI keeps OOM for some reasons but works fine locally. I will try a different GPU vendor
### Reproduce
_No response_
### Versions
na
-
### 🐛 Describe the bug
1. Remove test with too large tensors
2. Merge similar tests together
3. Remove unnecessary tests
### Reproduce
_No response_
### Versions
na
-
### 🐛 Describe the bug
when I load model with AutoLigerKernelForCausalLM ,I get ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)
when load mdoel Apply Model…
-
I have 2 datasets, from human and mouse, to be integrated. There is a batch effect within each dataset. Could some one tell me how to use liger to integrate these two datasets? thanks!
-
官方文档里的微调内容:如果想要对awq、gptq量化的模型进行qlora微调,你需要进行提前量化。例如可以对原始模型使用swift export进行量化。然后使用以下命令进行微调,你需要指定--quant_method来指定对应量化的方式:
CUDA_VISIBLE_DEVICES=0 swift sft \
--model_type qwen1half-7b-chat \
…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
- `llamafactory` version: 0.8.4.dev0
- Platform: Linux-5.15.0-94-generic-x86_64-with-glibc2.35
- Python…
-
### Problem Description
When running [axolotl](https://github.com/axolotl-ai-cloud/axolotl/) runs, I get intermittent GPU hangs:
```
{'loss': 0.4589, 'grad_norm': 1.0493940198290594, 'learning_…
-
### System Info
- Platform: Linux-4.18.0-477.15.1.el8_8.x86_64-x86_64-with-glibc2.28
- Python version: 3.10.15
- PyTorch version: 2.4.1
- CUDA device(s): NVIDIA A40, NVIDIA A40, NVIDIA A40, NVIDIA…
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports.
###…