ymcui Chinese-LLaMA-Alpaca issues

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

https://github.com/ymcui/Chinese-LLaMA-Alpaca/wiki

Apache License 2.0

18.23k stars 1.86k forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

微调模型数据填充问题

#768 xzqxnet0990 closed 1 year ago
4
v5.0: Release Pro model series and Plus-33B models

#766 ymcui closed 1 year ago
0
请问微调指令数据很长该怎么设置参数

#764 TheLolita closed 1 year ago
2
Api模式能提供流式响应模式吗?

#762 guyuexue closed 1 year ago
1
推理一直报错

#761 DRZJ1 closed 1 year ago
3
Update inference_hf.py for repetition_penalty

#760 ymcui closed 1 year ago
0
训练中报错Unable to concatenate an empty list of datasets.main()

#759 DRZJ1 closed 1 year ago
6
OPEN_AI API调用速度很慢

#758 Cliffe97 closed 1 year ago
2
关于33B的plus版

#757 nuoma closed 1 year ago
2
Adding load_in_8bit to inference_hf.py

#756 airaria closed 1 year ago
0
Adding --load_in_8bit option to inference_hf.py

#755 airaria closed 1 year ago
0
Alpaca-plus-7b微调时报错

#754 TCHSDUFH closed 1 year ago
2
lora权重加载问题

#751 Yuang-Deng closed 1 year ago
2
Chinese-Alpaca-Plus-7B 预训练报错

#750 mazhai closed 1 year ago
2
Why build_dataset.py combine source = "instruction + input + output" ==> overfit immediately ?

#749 thusinh1969 closed 1 year ago
3
chinese_llama_plus_lora_7b中的tokenizer, vocab size不正确。

#748 yeli7068 closed 1 year ago
2
执行 inference_hf.py 时提示缺少文件

#747 XXXG00W0 closed 1 year ago
12
运行ingest.py发生错误

#746 handsomexiaoyi closed 1 year ago
2
关于其他参数量模型的训练

#744 wuhuanon closed 1 year ago
2
Add patches for memory_efficient_attention and NTK scaling

#743 airaria closed 1 year ago
1
run_pt增量预训练，中断后重新训练，失败

#742 smartparrot closed 1 year ago
1
Chinese-Alpaca-Plus-7B 预训练后回答效果不理想

#739 zhaomeng0113 closed 1 year ago
7
OSError: ziqingyang/chinese-llama-plus-lora-7b does not appear to have a file named config.json.

#738 tanshuai closed 1 year ago
2
预训练后合并lora，之后load模型出现词表大小不对的问题

#737 Double-bear closed 1 year ago
2
使用Transformers推理，最新的Transformers已经没有scripts/inference_hf.py脚本

#736 aawuj closed 1 year ago
3
run_clm_sft_with_peft.py脚本是不是不支持shareGPT那种形式的多轮数据训练？

#735 xyfZzz closed 1 year ago
4
RuntimeError: Function MmBackward0 returned an invalid gradient at index 1 - expected device meta but got cuda:0

#734 wangjvjie closed 1 year ago
0
ValueError The vocab size of the tokenizer must be 49954, but found 49953

#733 wangjvjie closed 1 year ago
1
继续训练Chinese-Alpaca模型的LoRA权重，新的lora与哪个模型合并呢

#731 Agreewithu closed 1 year ago
7
lora预训练13B模型需要多大内存的GPU。单机双卡 2*24GB 会爆显

#730 wangxigui closed 1 year ago
11
什么意思，这个图我真的绷不住了

#729 CXLiang123 closed 1 year ago
4
在加入lora时，提前没有做prepare_model_for_kbit_training

#728 guijuzhejiang closed 1 year ago
1
继续预训练loss训飞

#727 HalcyonLiang closed 1 year ago
9
Add news of Visual-Chinese-LLaMA-Alpaca

#726 airaria closed 1 year ago
0
关于训练的基底模型

#725 wuhuanon closed 1 year ago
3
我在mac上合并的模型，然后拷贝到win/linux上使用，性能会有影响么？

#723 stoneLee81 closed 1 year ago
1
一机多卡执行训练报错，torchrun 的 --nproc_per_node 配置`2`时正常，配置为大于`2`的数值后报错

#722 shibingli closed 1 year ago
5
UserWarning: None of the inputs have requires_grad=True. Gradients will be None

#721 ljch2018 closed 1 year ago
3
预训练全量参数报错

#720 Double-bear closed 1 year ago
0
Update banner path, change default decoding values for Gradio demo

#719 ymcui closed 1 year ago
0
关于gradio_demo里prompt格式问题

#716 zeng9t closed 1 year ago
5
chinese_sp.model是如何训练的，是否能给出详细步骤及代码实现

#713 zemu121 closed 1 year ago
6
langchain示例未正確輸出

#712 cyc00518 closed 1 year ago
2
lora训练保存的adapter_model.bin很小，只有443字节

#711 guijuzhejiang closed 1 year ago
16
模型sft训练过程中进度条卡住一直不动，也不报错

#710 Qmymy closed 1 year ago
2
13B模型合并SHA256不一致，指令精调报错

#709 slxy-hub closed 1 year ago
2
请问在中文LLaMa进行sft的数据量是多少呀，想复现一下，数据因该是5w条的json，但是训练多少个epoch或者token数呀，我看介绍是指令4M ，这个没太理解是怎么算的。

#708 sixgold993 closed 1 year ago
6
Fix unexpected slow down in gradio web demo

#707 GoGoJoestar closed 1 year ago
2
Colab 最后量化为4bit时报错，4096*49954不能被256整除

#706 Ziffer-byakuya closed 1 year ago
2
Extend context size without fine-tuning

#705 airaria closed 1 year ago
4

Previous Next