hiyouga LLaMA-Factory issues

hiyouga / LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

https://arxiv.org/abs/2403.13372

Apache License 2.0

31.67k stars 3.9k forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

[Draft] Add AutoRound support

#5486 wenhuach21 opened 1 week ago
2
启动 webui失败

#5485 ClementeGao opened 1 week ago
0
请问DPO训练的时候有什么注意事项吗？我训练出来效果很差。

#5484 zlh-source opened 1 week ago
4
fix: 修复function call数据集如果 function_call 值的为不合法json，异常提示且中断训练。

#5483 whybeyoung closed 1 week ago
0
Qlora 报错

#5482 dayuyang1999 closed 1 week ago
1
如何加快微调qwen2-vl-7b后合并的模型在视频上的推理速度？

#5481 J0eky closed 1 week ago
1
fix ppo_freeze mat1 mat2 should have the same dtype

#5480 ex-yanminmin001 opened 1 week ago
4
qwen2vl-sft后如何将adapter_model.safetensors和模型原始参数合并使用

#5479 YajieW99 closed 1 week ago
1
Can we set default_system in yaml file when training?

#5478 Huarong closed 1 week ago
1
qwen2vl训练需要修改position_ids问题吗

#5477 sunzjz closed 1 week ago
2
qwen2-1.5微调训练后tokenizer_config.json中的chat_template值被改了

#5476 czhcc closed 1 week ago
2
Fix phi-3-small issues

#5475 menibrief opened 1 week ago
1
训练时template设为empty时，label开头会加上<|EOT|>，之前的版本好像不会这样

#5474 haoranjun opened 1 week ago
0
Support Mistral format tools

#5473 AlongWY opened 1 week ago
0
只全参数微调Qwen2-VL-7B-Instruct的visual.merger部分，冻结其他模型参数，训练过程报错

#5472 wjx-sudo opened 1 week ago
5
多卡微调时报错

#5471 Maydaytyh closed 1 week ago
2
如何自己编写代码加载合并后的模型推理视频？

#5469 J0eky closed 1 week ago
1
请问支持多图Qwen2-VL-7B-Instruct微调吗？数据格式有示例嚒？

#5468 WorldHellooo closed 1 week ago
6
如何自动保存checkpoint?

#5467 dayuyang1999 closed 1 week ago
2
webui启动之后框内元素无法渲染

#5466 DSW2001 closed 1 week ago
1
sft do_predict, 生成的json 文件的 label 都是空

#5465 dayuyang1999 opened 1 week ago
0
请问SFT之后的模型在推理的时候，是否可以返回多个response？

#5464 zlh-source closed 1 week ago
1
依赖项安装不了，cuda已安装

#5463 DSW2001 closed 1 week ago
2
qwen2_vl模型训练异常

#5462 will-wiki opened 1 week ago
2
AttributeError: 'Qwen2Attention' object has no attribute 'max_position_embeddings'

#5461 chengchengpei opened 1 week ago
1
Tips for implementing LlaMa-Factory for new Hardwares

#5460 EtashGuha opened 1 week ago
0
Do you support for full parameters pre-training?

#5459 lingchensanwen closed 1 week ago
1
Flatting Packing / maybe fix #5443 and #5426

#5458 AlongWY opened 1 week ago
8
no such a file or directory of data

#5457 Esmail-ibraheem opened 1 week ago
0
max pixels argument

#5456 sharonsalabiglossai opened 1 week ago
1
"Cannot find valid samples" when running DPO on llama3-8b

#5455 zky-kf closed 1 week ago
3
多卡制定HF_DATASETS_CACHE会报错

#5454 Fu-Dayuan closed 1 week ago
2
ValueError: Template qwen2 does not exist.

#5453 Oyounger closed 1 week ago
1
Correctly pass gen_kwarg to eval during model runs

#5451 aliencaocao opened 1 week ago
0
多机多卡运行报错

#5450 hecheng64 opened 1 week ago
0
qwen2-vl双卡全量微调OOM

#5449 hitsz-zxw closed 1 week ago
4
对微调后的GLM-4-9B-Chat运行examples/train_lora/llama3_lora_predict.yaml出错

#5447 Twilightsh opened 2 weeks ago
1
设置随机数种子后，相同数据集和配置的每次训练loss还是不一样

#5446 andy7002 closed 2 weeks ago
2
qizhen

#5445 A-magic closed 2 weeks ago
0
model.generate的参数在yaml中设定无效，我设了do_sample: false，使用profiler查看实际还是true 此问题只在训练中途的eval发生，训练结束的最后一次eval正常

#5444 aliencaocao opened 2 weeks ago
0
Running tokenizer on dataset 速度逐渐变慢

#5443 xuyue1112 opened 2 weeks ago
1
bitsandbytes qlora微调模型推理

#5442 oulin1031esti opened 2 weeks ago
0
help on understanding the implementation of FSDP.

#5441 jq-wei opened 2 weeks ago
0
如何在使用 openai 风格部署时，使用 beam search

#5440 cat-knight opened 2 weeks ago
0
Llama-factory使用错误

#5439 lifelsl closed 1 week ago
0
Add qwen_vl to liger kernel supported list

#5438 aliencaocao closed 2 weeks ago
0
请问使用qlora微调后生成的模型中哪里体现了量化的配置参数

#5437 yangxue-1 closed 2 weeks ago
1
微调后词表长度不一致怎么办

#5436 topology1 opened 2 weeks ago
0
Gemma 2 + unsloth + fa2 full SFT RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn

#5435 hengdos opened 2 weeks ago
0
请问，llamafactory现在支持在昇腾910上进行模型评估嘛？

#5434 yiyayieryo opened 2 weeks ago
3

Previous Next