issues
search
baichuan-inc
/
Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
https://huggingface.co/baichuan-inc/baichuan-7B
Apache License 2.0
5.66k
stars
504
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
添加一个 web 应用
#147
huia711
opened
1 week ago
0
[Question] 安装依赖时终端报错(deepspeed)
#146
duolaBmeng673
opened
2 months ago
0
[Question] 微信群的二维码失效了
#145
yzhao-2023
opened
2 months ago
0
[Question]不能安装xformers
#144
Acid-uncoin
opened
2 months ago
1
[BUG] 我下载了huggingface上的baichuan7b模型,使用 里面的测试程序测试发现CUDA错误
#143
QIANXUNZDL123
opened
3 months ago
1
[Question] 参数合并后有什么要注意的吗? 我将7B参数和微调参数合并之后,加载新模型,显存占用超过了24G,这个跟原始7B所需显存差很多?这会是什么导致的
#142
Micla-SHL
opened
5 months ago
0
baichuan2和baichaun2-7B这俩仓库有啥区别吗
#141
fxb392
opened
5 months ago
0
[Question] Baichuan-Text-Embedding can be open for open source or have api to use or pay for use? thanks
#140
Yazooliu
opened
6 months ago
0
[Question] 我想用 Baichuan-7B来开发中文文本纠错功能,主要是错别字,请问下可行性?
#139
suchstar
opened
7 months ago
0
想问一下在A800上测试的吞吐量,换算到推理速度的话有多少tokens/s?
#138
HJT9328
opened
8 months ago
0
[Typo]
#137
Chandler-Bing
opened
9 months ago
0
[Question] RoPE的实现和论文里不一致
#136
zehmaaa
opened
9 months ago
1
add code: sequence classification
#135
don-tpanic
closed
10 months ago
0
[Question] 可以提供模型的国内下载源吗
#134
liulfy
opened
10 months ago
0
[BUG] CUDA Out of Memory when eval model.
#133
Crystalxd
opened
10 months ago
5
[Question] DeepSpeed Zero3 save_checkpoint() got empty mode_states files
#132
mynewstart
opened
10 months ago
3
能提供个类似open_api.py的文件,可以供我们使用接口进行测试吗?
#131
mawenju203
opened
10 months ago
0
[Question] 请问7B没有用上FlashAttention吗?
#130
nezhazheng
opened
10 months ago
1
Add OpenCompass badge in README
#129
vansin
opened
10 months ago
0
[Evaluation] 提供 Baichuan 模型在 OpenCompass 上的评测结果
#128
Leymore
opened
10 months ago
0
[Question] Baichuan-7B多GPU 原生部署、 int8 和 int4 量化部署
#127
potong
opened
11 months ago
0
[Question] Baichuan-7B多卡GPU 原生部署、 int8 和 int4 量化部署方法
#126
potong
closed
11 months ago
0
[Question] 多GPU部署Baichuan-7B方法
#125
potong
closed
11 months ago
0
[Question] 关于数据处理的疑问
#124
mynewstart
opened
11 months ago
0
Update QRCode
#123
baichuan-assistant
closed
11 months ago
0
Update Wechat QRCode
#122
baichuan-assistant
closed
11 months ago
0
我要做预训练通用模型,样本数据加载这里可以给个demo数据?
#121
wangweihua11
opened
11 months ago
0
请问想接上下句古诗 需要怎么写提示词?
#120
goog
opened
11 months ago
0
pretrain learning rate is le-8?
#119
hegang1-tal
opened
11 months ago
0
请问部署后,如何通过API调用?
#118
lemon-simple
opened
11 months ago
0
[Question] 你好,训练分词模型的代码可以分享吗?或者有什么参考吗?
#117
StarrySeas1
opened
11 months ago
0
[Question]
#116
wqmoran
closed
12 months ago
0
[Question] 请问继续预训练的loss降到什么水平是合格的
#115
parkLGW
opened
1 year ago
0
Can I use baichuan 7b for reading comprehension?
#114
powerpistn
opened
1 year ago
0
请问13b的全参数微调, 以及全参数指令微调,能够用7b的train.py吗[Question]
#113
quzx
opened
1 year ago
0
[Question] 训练垂直领域的模型,增量预训练的token数需要达到多少才能有比较好的效果?
#112
parkLGW
opened
1 year ago
3
[Question] Baichuan模型中的Attention模块为什么在train的时候没有用到attention_mask?
#111
sigmundchen
opened
1 year ago
1
请问部署推理,最小的GPU显存需要多大呢?以及内存需要多大?[Question]
#110
ArlanCooper
opened
1 year ago
1
[Question] 单机单卡训练,报错,无法初始化梯度。
#109
xkjcf
opened
1 year ago
7
[Question] 是否有对Tasks.word_segmentation 任务的分词示例代码
#108
luxiaobai007
opened
1 year ago
0
邮箱地址失效
#107
zhpmatrix
opened
1 year ago
0
[Question] 当继续预训练是,loss一直是2.2几的状态,请问作者预训练阶段也是如此吗?
#106
chenglu66
opened
1 year ago
2
[Question]
#105
felixdae
opened
1 year ago
0
[Question] output为什么要包含input呢
#104
ghost
opened
1 year ago
0
[Question] 模型参数问题
#103
L-hongbin
opened
1 year ago
0
[Question] 后续打算出更小的版本么,如3B,1B等。
#102
tingxinli1
opened
1 year ago
0
evaluate_mmlu.py文件中categories是啥包?是pycategories包吗?
#101
kunzeng-ch
opened
1 year ago
1
[Question] 如何测试达到Max_token上限的输出。
#100
cason0126
opened
1 year ago
0
依赖冲突,希望官方调整一下
#99
gsy44355
opened
1 year ago
0
[Question] 关于模型在agi-eval上的评测细节
#98
yangkexin
opened
1 year ago
1
Next