issues
search
hiyouga
/
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Apache License 2.0
25.1k
stars
3.11k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Question about data preprocess
#4570
HackGiter
opened
1 hour ago
0
kto训练要求response大于1(feedback_dataset)函数
#4564
tcxia
closed
3 hours ago
4
[Help] Yi-34B + Simpo + Full + Novel writing task, fine-tuning results are unsatisfactory!
#4563
onlyfish79
closed
3 hours ago
0
Out of Memory Error on Sagemaker while training LLava on 93000 images
#4562
Hassaan68
opened
3 hours ago
0
fix torch-npu dependency
#4561
hashstone
opened
4 hours ago
1
unsloth只支持lora吗。不支持pretrain?
#4560
af-74413592
closed
4 hours ago
1
CUDA out of memory | QLORA | Llama 3 70B | 4 * NVIDIA A10G 24 Gb
#4559
russellorv
closed
4 hours ago
5
llama board测评模型效果差
#4558
xiao-liya
closed
4 hours ago
0
Get "RuntimeError: 'weight' must be 2-D" Error when finetuning llama3-8b using ZeRO3 and customised dataset
#4557
NeWive
closed
4 hours ago
2
[bug] pissa init脚本报错
#4556
letterk
closed
4 hours ago
1
API_PORT=8000 llamafactory-cli api examples/inference/qwen2_vllm.yaml报错
#4555
frostjsy
closed
4 hours ago
1
微调llama3-8b的时候,eval_loss不断上升,考虑到了使用多个数据集混合,但还是没有效果,应该怎么解决?
#4554
MemoryOldTime
closed
3 hours ago
0
llamafactory-cli支持输出向量吗?
#4553
frostjsy
closed
7 hours ago
1
Help! 训练结束加载大模型,使用训练数据集中的问题向大模型提问,答案不是训练数据集中的答案。训练耗时不足2分钟,怀疑训练失败,请大佬们帮忙诊断一下!!
#4552
muliu
closed
8 hours ago
3
llamafactory-cli api examples/inference/qwen2_vllm.yaml报错
#4551
frostjsy
closed
8 hours ago
0
Llama3 微调文本分类数据集准备的格式,以及微调后模型回答输出非标签内容
#4550
git3210
closed
7 hours ago
1
【问题】为什么要把可训练参数精度强行转换为全精度?
#4549
LaniakeaS
closed
3 hours ago
1
请教多轮工具调用的数据格式
#4548
lwj2001
closed
3 hours ago
0
lora微调Qwen-14b-chat后导入模型报错
#4547
ntz2000
closed
8 hours ago
1
出一个中文版的操作手册
#4546
countree
closed
8 hours ago
1
vllm部署freeze微调后的qwen2-57b-instruct报错
#4545
hexixiang
closed
8 hours ago
1
fix docker-compose path
#4544
MengqingCao
closed
13 hours ago
1
how to authenticate to access huggingface
#4543
ares0027
closed
13 hours ago
1
什么时候可以支持glm4v等多模态模型微调呀
#4542
yang182
closed
14 hours ago
1
scripts/pissa_init.py to initialize PiSSA for a quantized model.
#4463
Katehuuh
closed
1 day ago
1
对sft阶段的数据进行packing之后,同一条训练数据内的指令之间是否会相互影响?
#4462
Zheng-Jay
closed
1 day ago
0
support flash-attn in Dockerfile
#4461
hzhaoy
opened
1 day ago
1
## feature request ## 支持 ZeRO3 infinity
#4460
Xingxiangrui
closed
1 day ago
1
deepspeed zero3 出现 training_eval_loss 图为空白
#4459
ycjcl868
closed
1 day ago
1
PPO 跑example例子报错:value should be one of int, float, str, bool, or torch.Tensor
#4458
xudong2019
closed
4 hours ago
1
关于基座模型和对话模型的疑问
#4457
VirgilG72
closed
3 hours ago
0
docker容器内没有example和data文件
#4456
yaya159456
closed
1 day ago
0
最新代码中没有llamafactory-cli ,怎么合并权重
#4455
Liufeiran123
closed
1 day ago
2
lora微调后的glm4模型不生成回答
#4454
RyanCcc114
closed
3 hours ago
0
8*A800 80G lora训练qwen2-72B模型 内存占用异常
#4453
999wwx
opened
1 day ago
2
LoRA微调和全参微调的时候总是会出现过拟合,在无法提高数据集大小的情况下,应该如何解决这个问题呢
#4452
MemoryOldTime
closed
1 day ago
2
如何指定已划分好的训练集和验证集?
#4451
Owen1u
closed
1 day ago
1
Memory Error during tokenization while fine tuning LLava1.5-7B-Chat more than 8000 images
#4450
Hassaan68
closed
2 days ago
2
qlora微调Qwen2-57B。使用单卡A6000显存占用40G,使用双卡A6000则是两张卡各占40G显存,请问是什么原因?
#4447
PhysicianHOYA
closed
2 days ago
0
Bug Fix: `off` is parsed as `False` in yaml file
#4446
stceum
closed
2 days ago
0
auto-label npu issue
#4445
MengqingCao
closed
2 days ago
0
stop word of template of qwen
#4444
HackGiter
closed
2 days ago
1
请问支持 early stopping 吗?
#4443
laozhai507
closed
2 days ago
0
请问改工程可以用来glm4的增量预训练吗
#4442
gyh123wqe
closed
2 days ago
1
训练glm4报错:RuntimeError when using flash attention with 8-bit quantization,同样的参数训llama3则没问题
#4441
fst813
closed
2 days ago
1
How to pre-train Llava1.5 from vicuna1.5?
#4440
yuzhms
closed
2 days ago
1
预训练方式lora微调Qwen2 base模型,是否需要添加template
#4439
Shame-fight
closed
2 days ago
1
单机多卡微调glm4-9B设置max_grad_norm=1,但是仍然出现了梯度爆炸的问题
#4438
KATHERINE0511
closed
2 days ago
1
fsdp+fp16 全参数微调是否支持呢
#4437
zhangfan-algo
closed
2 days ago
1
Low MMLU of llama2
#4436
Zkli-hub
closed
2 days ago
1
Next