issues
search
liangwq
/
Chatglm_lora_multi-gpu
chatglm多gpu用deepspeed和
404
stars
61
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Chatglm_lora_multi-gpu/APP_example/real_time_draw/realtime_draw_01.py
#52
hanryqiang2
opened
7 months ago
2
clip retrieval 成功运行app.py,但是不显示streamlit界面
#51
HuangSiYuan00
closed
8 months ago
2
看下我的搜索结果对不对
#50
jie1126
closed
1 year ago
0
解析相应报错
#49
jie1126
closed
11 months ago
3
huggingface_hub.utils._validators.HFValidationError
#48
jie1126
opened
1 year ago
3
langchain版本是多少
#47
jie1126
closed
11 months ago
1
推理问题
#46
UncleFB
opened
1 year ago
17
Deepspeed并未生效
#45
Mrjude
closed
11 months ago
1
多卡deepspeed模式
#44
Tongjilibo
closed
11 months ago
2
chatglm做图应用怎么使用
#43
yuxuan2015
closed
11 months ago
1
chatglm用deepspeed多卡推理问题
#42
flyme2023
opened
1 year ago
1
多卡并行问题
#41
awelldone
closed
11 months ago
1
alps import error
#40
fyj3266098
closed
11 months ago
1
模型是否存在信息泄露
#39
980202006
closed
11 months ago
4
deepspeed和lora
#38
kevinuserdd
opened
1 year ago
4
运行 web_feadback.py 报错
#37
Cola-Ice
opened
1 year ago
6
运行web_ui.py,报错:NameError: name 'LoraConfig' is not defined
#36
Cola-Ice
closed
1 year ago
1
你的README.md与Chatglm_lora_multi-gpu/data
#35
molyswu
opened
1 year ago
1
显存占用问题
#33
lelegogo26
opened
1 year ago
2
如何训练自己数据集
#32
lelegogo26
opened
1 year ago
1
运行deepspeed --num_gpus 2 chatglm_deepspeed_inference.py进行推理,没有生成yitu_output.csv是什么原因呢?
#31
algorithmconquer
opened
1 year ago
3
推理阶段deepspeed --num_gpus 2 chatglm_multi_gpu_inference.py报错
#30
algorithmconquer
opened
1 year ago
4
保存的模型只有adapter_model.bin,没有adapter_config.json是什么原因呢?
#29
algorithmconquer
opened
1 year ago
2
> @dsh54054 这个问题您解决了吗,遇到同样的问题了
#28
algorithmconquer
closed
1 year ago
0
NotADirectoryError: [Errno 20] Not a directory: 'hipconfig'
#27
algorithmconquer
opened
1 year ago
3
title拼错了
#26
huangxd-
closed
1 year ago
1
模型检查到一半就报错,大佬能帮我看看吗
#25
WXD7
opened
1 year ago
2
关于多GPU训练
#24
z1968357787
closed
11 months ago
1
数据集是不是失效了
#23
starhui70520
opened
1 year ago
3
ValueError: 150004 is not in list
#22
dsh54054
opened
1 year ago
8
Readme链接有错
#21
TheReluctantHeroes
opened
1 year ago
1
chatglm 分片模型不适合deepspeed
#20
kevinuserdd
opened
1 year ago
4
Belle数据集更新
#19
Mr-lonely0
closed
1 year ago
3
RuntimeError: expected scalar type Half but found Float
#18
lmx760581375
opened
1 year ago
12
deepspeed推理多进程问题
#17
kevinuserdd
opened
1 year ago
6
请问ddp模式的如何分布式导入模型?
#16
bai1451746927
closed
1 year ago
1
out of memory 显存溢出
#15
2023March
opened
1 year ago
1
这个好像也有 训练完之后lora_B.weight全都是0 的问题
#14
llplay
closed
1 year ago
5
可以增加个finetune之后的推断脚本吗
#13
llplay
closed
1 year ago
3
那个方法比较容易跑起来
#12
Chenzongchao
closed
1 year ago
1
ValueError: 150004 is not in list
#11
littlerookie
closed
1 year ago
6
数据集过大导致,服务器内存溢出 被kill
#10
2023March
opened
1 year ago
3
一些问题
#9
firslov
closed
1 year ago
3
初始化的时候报错
#8
aiaiyueq11
opened
1 year ago
0
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0! (when checking argument for argument index in method wrapper__index_select)
#7
xiaoweiweixiao
opened
1 year ago
4
防止训练显存超出,增加清理显存
#6
Flat-Chen
closed
1 year ago
0
偶现 ModuleNotFoundError: No module named 'transformers_modules.chatglm-6b.tokenization_chatglm'
#5
llplay
closed
1 year ago
3
报错求助
#4
Traceve
opened
1 year ago
1
一张卡能运行,两张卡报错
#3
Flat-Chen
closed
1 year ago
5
请问可能提供data文件夹中的数据吗?
#2
robin087
closed
1 year ago
1
Next