liger Search Results - Githubissues

872 results
for liger

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hiyouga/LLaMA-Factory #6143

两台机器全参数微调Qwen2.5-14B-Instruct挂起不动

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.9.1.dev0 - Platform: Linux-5.15.0-125-generic-x86_64-with-glibc2.31 -…

zhaoxjmail updated 6 days ago
2
linkedin/Liger-Kernel #335

Broken CI

### 🐛 Describe the bug The CI keeps OOM for some reasons but works fine locally. I will try a different GPU vendor ### Reproduce _No response_ ### Versions na

ByronHsu updated 4 weeks ago
1
linkedin/Liger-Kernel #334

Unit test time is too long

### 🐛 Describe the bug 1. Remove test with too large tensors 2. Merge similar tests together 3. Remove unnecessary tests ### Reproduce _No response_ ### Versions na

ByronHsu updated 4 weeks ago
1
linkedin/Liger-Kernel #268

inference qwen2 model ,The reasoning is garbled and ValueE…

### 🐛 Describe the bug when I load model with AutoLigerKernelForCausalLM ,I get ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?) when load mdoel Apply Model…

Dujianhua1008 updated 1 month ago
1
welch-lab/liger #322

how to deal with batch in cross species comparison?

I have 2 datasets, from human and mouse, to be integrated. There is a batch effect within each dataset. Could some one tell me how to use liger to integrate these two datasets? thanks!

Li-ZhiD updated 1 month ago
4
modelscope/ms-swift #2124

RuntimeError: self and mat2 must have the same dtype, but go…

官方文档里的微调内容：如果想要对awq、gptq量化的模型进行qlora微调，你需要进行提前量化。例如可以对原始模型使用swift export进行量化。然后使用以下命令进行微调，你需要指定--quant_method来指定对应量化的方式： CUDA_VISIBLE_DEVICES=0 swift sft \ --model_type qwen1half-7b-chat \ …

LIUKAI0815 updated 1 month ago
4
hiyouga/LLaMA-Factory #5449

qwen2-vl双卡全量微调OOM

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.8.4.dev0 - Platform: Linux-5.15.0-94-generic-x86_64-with-glibc2.35 - Python…

hitsz-zxw updated 1 month ago
6
ROCm/ROCm #4021

[Issue]: Intermittent GPU Hang HW Exception by GPU on MI300X…

### Problem Description When running [axolotl](https://github.com/axolotl-ai-cloud/axolotl/) runs, I get intermittent GPU hangs: ``` {'loss': 0.4589, 'grad_norm': 1.0493940198290594, 'learning_…

lhl updated 1 day ago
28
huggingface/trl #2382

DPO does not work for FIM task with non-instruct model

### System Info - Platform: Linux-4.18.0-477.15.1.el8_8.x86_64-x86_64-with-glibc2.28 - Python version: 3.10.15 - PyTorch version: 2.4.1 - CUDA device(s): NVIDIA A40, NVIDIA A40, NVIDIA A40, NVIDIA…

AML14 updated 1 week ago
6
axolotl-ai-cloud/axolotl #1991

RuntimeError: CUDA error: an illegal memory access was encou…

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports. ###…

Malikeh97 updated 2 weeks ago
11

上一页 1...15 16 17 18 19 20 21...88 下一页

872 results for liger

872 results
for liger