issues
search
baichuan-inc
/
Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
https://huggingface.co/baichuan-inc
Apache License 2.0
4.03k
stars
286
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
13B-chat微调训练每一步训练时长很长
#362
KevinFan0
opened
5 months ago
2
baichuan2-13B-chat,生成速度慢,输出时是乱码,十几个字符后程序就蹦了,求助原因
#361
blueskyban
closed
5 months ago
0
合并后的模型chat时报错:generation_utils.py unsupported operand type(s) for -: 'int' and 'NoneType
#360
growmuye
opened
5 months ago
0
运行需要什么样的python环境?提示xFormers版本问题
#359
qianma819
opened
5 months ago
1
Baichuan 2 支持昇腾 NPU 推理
#358
ssm0808
opened
5 months ago
2
使用LLAMA 自定义数据集训练Baichuan2-7B-Chat 回答语无伦次到底是什么问题?
#357
TzyTman
opened
5 months ago
1
When loading the tokenizer: ModuleNotFoundError: No module named 'transformers_modules.baichuan_0'
#356
zouyingcao
closed
5 months ago
0
model.chat()流式输出,如何捕捉oom异常?
#355
xieyongshuai
opened
5 months ago
0
各位大佬 GPU功耗很低 但是GPU利用率满载 是什么情况
#354
oho-work
closed
5 months ago
1
baichuan2-13b-chat微调错误:Expecting property name enclosed in double quotes
#353
jiaweiLL
opened
5 months ago
1
什么时候提供更大长度的模型,和支持agents的模型啊?
#352
ljwps
closed
5 months ago
1
请问百川官网的百川大模型部署的是多大参数的模型呢?
#351
fxb392
closed
5 months ago
1
what is z_loss_weight?
#350
AICHENaxx
closed
5 months ago
1
请问Baichuan2-13B-Chat是否支持按照openai的格式输出
#349
zhangfuliang66
closed
5 months ago
1
问下百川模型有做过算法备案么?
#348
ljwps
closed
5 months ago
5
翻译的测评任务中opencompass的prompt设计样例?
#347
hanjr92
opened
5 months ago
0
20核cpu,32G内存,用cpu推理,加载Baichuan-7B-Chat的模型内存溢出
#346
one-farmer
opened
5 months ago
1
ModuleNotFoundError: No module named 'transformers_modules.Baichuan2-13B-Chat-v2
#345
xealml
opened
5 months ago
0
13B-chat量化之后推理报错
#344
LinXin04
opened
5 months ago
1
13B-chat-v2版本长度怎么设置为8192
#343
Dusangrm
closed
5 months ago
1
运行web_demo.py报错
#342
fxb392
closed
6 months ago
1
Hi, have a question : Baichuan-Text-Embedding can be open for open source or have api to use or pay for use? thanks
#341
Yazooliu
closed
5 months ago
3
ValueError: Target module NormHead() is not supported.
#340
AlexJJJChen
opened
6 months ago
0
求助:A10的推理速度比3090慢一倍
#339
aiaiyueq11
opened
6 months ago
0
A
#338
aiaiyueq11
closed
6 months ago
0
V2版本中模型处理长度提升到8192
#337
hediyuan
closed
5 months ago
1
全参数继续预训练与lora微调时,应该怎么样设置学习率呢?
#336
Jay931003
opened
6 months ago
0
调用微调模型进行推理,报错OSError: ./models/augment-step2-8e-4 does not appear to have a file named config.json.
#335
LuckyGlass
closed
6 months ago
1
请问在baichuan2-13b-chat中,如何修改wen-demo.py,使得询问模拟你是谁或你来自哪里,回答自定义的答案。类似ChatGLM中的System_prompt
#334
jiaweiLL
closed
6 months ago
1
训练Baichuan2-7B-Base报OOM异常
#333
guoyjalihy
opened
6 months ago
3
两块4090显卡跑baichuan2 -13b-chat 报错
#332
phphappy
opened
6 months ago
1
内置的role只有,system, user,assistant三种,是否支持新增role呢?
#331
minmie
opened
6 months ago
0
运行cli_demo.py显示下面的问题
#330
Damonpkl
closed
6 months ago
2
第一轮训练正常,多轮训练OOM
#329
liunian-Jay
opened
6 months ago
3
加载8bit量化/离线量化模型报错:RuntimeError: probability tensor contains either inf, nan or element < 0
#328
jiaweiLL
opened
6 months ago
0
求教:模型输出结果不完整问题。
#327
empty2enrich
opened
6 months ago
3
请问目前模型经过微调后,可以把每个token的embedding向量dump下来吗?
#326
wanghao19970205
opened
6 months ago
1
Max-z loss
#325
bpwl0121
closed
6 months ago
2
API 调用怎么分析PDF文件
#324
ydh10002023
opened
6 months ago
9
Baichuan2-13B-Base全量微调问题
#323
xiaocangsheng
opened
6 months ago
1
对于特定输入baichuan2-7b-base模型会输出为空
#322
guankaisi
opened
6 months ago
3
求解惑,使用示例的quantize量化方式与使用BitsAndBytesConfig量化有什么区别?
#321
Songjw133
closed
6 months ago
2
请问全参数微调所需要的最低配置是多少?
#320
richey07
closed
6 months ago
1
百川预训练时对表格数据的处理
#319
sunshineflg
closed
6 months ago
1
商用授权
#318
DSXiangLi
closed
6 months ago
2
在线量化之后保存模型遇到问题
#317
MurraryZhao
opened
6 months ago
1
压缩率是如何计算的
#316
sunshineflg
opened
6 months ago
0
RuntimeError: probability tensor contains either inf, nan or element < 0
#315
HiXiaochen
opened
6 months ago
2
微调报错
#314
Dmm2584v
opened
6 months ago
1
loss 全是0
#313
whk6688
opened
6 months ago
8
Previous
Next