Facico Chinese-Vicuna issues

Facico / Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

https://github.com/Facico/Chinese-Vicuna

Apache License 2.0

4.14k stars 421 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

用finetune_others_continue.sh自己训练的数据，行4-bit量化后在llama回答不了，其他的问题也比较乱

#157 qqshuqain closed 1 year ago
3
RuntimeError: Error(s) in loading state_dict for PeftModelForCausalLM: size mismatch for base_model.model.model.layers.0

#156 zhiyixu closed 1 year ago
3
requeirements中给的https翻墙也会出现404问题

#155 153zml closed 1 year ago
5
构造古诗词（65263首）单指令语料直接使用finetune从头开始训练

#154 grantchenhuarong closed 1 year ago
3
使用需要遵循的协议

#153 yuanjing-jane closed 1 year ago
1
关于古诗词（347首）训练的后继问题

#152 grantchenhuarong closed 1 year ago
3
使用定时器方式监测断点执行finetune

#150 grantchenhuarong closed 1 year ago
2
checkpoint-11600这个在哪里下载

#149 jialongguo closed 1 year ago
0
instruct_chat_50k.jsonl 数据

#148 ethan-funny closed 1 year ago
0
cuda out of memory in saving steps.

#147 jzsbioinfo closed 1 year ago
2
请问用4090单卡是否可以lora微调13b模型？

#146 mircop1t closed 1 year ago
7
finetune.py中关于选择lora模型的代码段逻辑有些问题

#145 grantchenhuarong closed 1 year ago
0
finetune_others_continue.sh 报AttributeError: 'NoneType' object has no attribute 'print_trainable_parameters'

#144 qqshuqain closed 1 year ago
4
使用您提供的simple_test.py脚本测试，发现token第一个起始字符id是0，识别也是乱码

#143 leizhu1989 closed 1 year ago
15
我用这个模型发现已经支持中文，请问是他自己实现的还是组合了本模型

#142 mushuanli closed 1 year ago
1
python finetune.py --data_path ./sample/merge_sample.json --test_size 9 训练报错

#141 jackywei1228 closed 1 year ago
5
微调之后加载权重发现输出停不下来

#140 wilson9x1 closed 1 year ago
11
使用Lora和不使用Lora速度会差很多

#139 cxj01 closed 1 year ago
3
效果炸裂: Langchain + vicuna-13b + 本地知识库

#138 csunny opened 1 year ago
2
guanaco_belle_merge_v1.0 直接merge可能重复的情况？

#137 wilson9x1 closed 1 year ago
1
训练的3.0轮是够的么？

#136 wilson9x1 closed 1 year ago
1
提示：ERROR: Could not find a version that satisfies the requirement triton==2.0.0 (from versions: none) 是什么情况？

#135 davedat closed 1 year ago
2
README.md mis-inforamtion: `LLAMA13B is numerically superior to 175B GPT3）` -> 17.5B

#134 huide9 closed 1 year ago
2
希望作者提供 Chinese-Vicuna/Chinese-Vicuna-lora-13b-belle-and-guanaco 的训练过程的优化器参数

#133 greatewei closed 1 year ago
7
interaction.py 第158行，返回了两次history

#132 Rogerspy closed 1 year ago
5
If you tried to load a PyTorch model from a TF 2.0 checkpoint, please set from_tf=True

#131 longkeyy closed 1 year ago
3
流式输出结果和最终结果长度/内容相差较大

#130 Aiemu closed 1 year ago
1
How to run vicuna on a CPU with 16GB of RAM?

#129 DimIsaev closed 1 year ago
0
if it is possible to fine tune your model

#128 TY-hue123 closed 1 year ago
3
我构建了一个在共享GPU机器上也能提供公开服务测试的接口

#127 youkpan closed 1 year ago
1
bash generate.sh FAILED with AttributeError: 'NoneType' object has no attribute 'eval'

#126 SeekPoint closed 1 year ago
1
what the pytorch version you used ?

#125 SeekPoint closed 1 year ago
1
OSError: Can't load tokenizer for './llama-hf/llama-7b/'

#124 adaaaaaa closed 1 year ago
2
bash finetune_continue.sh failed with 'RuntimeError: Trainer requires either a model or model_init argument'

#123 SeekPoint opened 1 year ago
2
用7B的模型，以及官方checkpoint-final运行generate.sh，效果很差

#122 dizhenx closed 1 year ago
4
llama 7b lora微调效果比展示的差很多。https://github.com/Facico/Chinese-Vicuna/blob/master/docs/performance.md

#121 jzsbioinfo closed 1 year ago
4
两张3090多卡训练，运行finetune.sh后无任何报错，但是程序直接中断

#120 JupyterChu closed 1 year ago
2
这是什么情况，后面的回答都是回答的第一个问题？

#119 lichao4Java closed 1 year ago
2
请问13b模型具体是怎么微调的？用的什么参数和数据？还是直接用的finetune.sh么？

#118 jzsbioinfo closed 1 year ago
1
The tokenizer class you load from this checkpoint is not the same type as the class this function is called from. It may result in unexpected tokenization.

#117 dizhenx closed 1 year ago
1
May I ask how much training data you have for each language?

#116 god-ts closed 1 year ago
1
使用checkpoint-11600训练效果出不来，求指教。

#115 grantchenhuarong closed 1 year ago
31
纯CPP 推理，make chat 失败

#114 shikeno closed 1 year ago
3
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:

#113 SHITIANYU-hue closed 1 year ago
1
更新代码后，重新执行finetune.sh出错， TypeError: init_process_group() got multiple values for keyword argument 'backend'

#112 alisyzhu opened 1 year ago
7
运行generate.sh后一直报model为NoneType

#111 JupyterChu closed 1 year ago
2
Something went wrong Expecting value: line 1 column 1 (char 0)

#110 FantDing closed 1 year ago
2
The server console prints an error message when I submit a request from the browser

#109 ImGoodBai closed 1 year ago
2
请问这个的效果和vicuna相比怎么样

#108 rayvzn119 closed 1 year ago
6
需要最低配置是什么？

#107 dravinbox opened 1 year ago
4

Previous Next