THUDM / GLM

GLM (General Language Model)
MIT License
3.2k stars 327 forks source link

chatglm-6b #134

Open mx8435 opened 1 year ago

mx8435 commented 1 year ago

Hi,Is the GLM model in chatglm-6b pretrained on this repo by just modifying some tricks (such as add rope embedding, use new tokenizer), or use the GLM-130b based repo which can use megatron model-parallel. Thanks.

Martin-WMM commented 1 year ago

I also want to know the solution about this topic