THUDM / ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Other
15.68k stars 1.85k forks source link

微调 #576

Open ysq873 opened 11 months ago

ysq873 commented 11 months ago

Is there an existing issue for this?

Current Behavior

大佬们,如果一个序列里有多个[MASK],chatGLM的微调能在一次训练中同时生成多个[MASK]吗?还是说需要先生成第一个[MASK],然后生成下一个[MASK],我看GLM论文好像是直接生成所有MASK??

Expected Behavior

No response

Steps To Reproduce

Environment

Anything else?

No response