issues
search
lich99
/
ChatGLM-finetune-LoRA
Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)
Apache License 2.0
726
stars
64
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
CUDA error: device-side assert triggered
#52
488283943
opened
8 months ago
0
报错
#49
zjbian
opened
1 year ago
0
报错
#48
fxb392
opened
1 year ago
0
不考虑GLM的双向注意力部分,注意力矩阵不是一个下三角矩阵吗
#47
fxb392
opened
1 year ago
0
显存问题
#46
zhangsanjava
opened
1 year ago
0
位置编码
#45
fxb392
closed
1 year ago
3
作者你好,这个项目对python的版本有什么要求吗,3.7是否可以?
#44
tjulh
opened
1 year ago
1
while cnt < retry_cnt:
#43
fxb392
opened
1 year ago
0
问答数据集如何构建
#42
Godlikemandyy
opened
1 year ago
0
请问有交流群吗?
#41
roki1031
opened
1 year ago
0
有大佬们试过用8卡训练的吗?
#40
tu2022
closed
1 year ago
0
请问LORA模型参数怎么加载到原模型里呢?
#39
ZeyuBa
closed
1 year ago
0
peft 0.3.0如何设置adapater_name
#38
moseshu
opened
1 year ago
8
context_length = obj['prompt'].index(130004)
#37
moseshu
closed
1 year ago
2
LoRA训练时间大概是多久呢?
#36
realcarlos
opened
1 year ago
3
finetune没效果
#34
ChenBinfighting1
opened
1 year ago
4
关于ZeRO的疑问?
#33
MAxx8371
opened
1 year ago
3
example.ipynb中进行训练测试loss为nan
#32
SilentMoebuta
opened
1 year ago
1
Fix some minor typos
#31
l0rinc
closed
1 year ago
0
[deepspeed] OVERFLOW!
#30
JingerAI
opened
1 year ago
1
No such file or directory: '/root/.cache/huggingface/modules/transformers_modules/chatglm-6b/tokenization_chatglm.py'
#29
Data2Me
opened
1 year ago
8
ValueError: 150004 is not in list是什么回事?
#28
z1968357787
closed
1 year ago
3
torch.distributed.elastic.multiprocessing.errors.ChildFailedError, when running the train_new.py
#27
Skywalker-Harrison
opened
1 year ago
2
这个显卡要求一定是bfloat16吗
#26
z1968357787
opened
1 year ago
3
关于分布式GPU训练
#25
z1968357787
opened
1 year ago
1
example_simple报错
#24
qishisurenhhh
opened
1 year ago
1
here are my questions,I have more than 4 gpus to run the train.py,but it still out of memory,I check the usage of memory and find that one of them overflows and produce the bug,how can I solve it?
#23
z1968357787
opened
1 year ago
0
LoRA的A矩阵一直不更新
#22
qz701731tby
opened
1 year ago
1
您好,问一下,这个训练完有可以展示与原来基础模型进行对比的测试效果吗
#21
kunshou123
opened
1 year ago
0
单机多卡报错
#20
ForgetThatNight
opened
1 year ago
0
训练loss变为NaN
#19
qz701731tby
closed
1 year ago
2
已获取
#18
lbxcfx
closed
1 year ago
0
subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.
#17
xiamaozi11
opened
1 year ago
2
RuntimeError: CUDA error: invalid device ordinal
#16
yanqiangmiffy
closed
1 year ago
1
有什么卡训练的,V100完全搞不定
#15
huzaizi2023
opened
1 year ago
11
LORAConfig报错:ValueError: Target modules ['q', 'k', 'v'] not found in the base model. Please check the target modules and try again.
#14
nameless0704
closed
1 year ago
6
train.py的命令行启动是什么?
#13
Data2Me
closed
1 year ago
1
如何设置batch_size个数,变动后train会变维度上的错误
#12
GUORUIWANG
closed
1 year ago
1
可以给一下具体的环境requirement吗?
#11
Data2Me
closed
1 year ago
1
请问DeepSpeed与Multi-gpu是绑定的吗?使用DeepSpeed提升效果有多大?
#10
nameless0704
closed
1 year ago
2
About multi-GPU
#9
zhongtao93
closed
1 year ago
6
请问下Finetune之后能实现企业定制FAQ的效果吗?可能有一百个问答这样
#8
terryops
opened
1 year ago
8
能否使用量化后的chatGLM-6b-int4小模型进行微调?
#7
valkryhx
closed
1 year ago
4
两个epochs之间,loss值并没有降下来
#6
aizpy
closed
1 year ago
1
NameError: name 'train_dataloader' is not defined
#5
wccccp
closed
1 year ago
1
训练后的结果对应不上
#4
zhangyanbo2007
closed
1 year ago
1
训练超显存
#3
GaoPengGit
closed
1 year ago
5
数据集和微调模型的一些问题
#2
SarmonFish
closed
1 year ago
1
Is there any details about dataset?
#1
980202006
closed
1 year ago
2