THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Apache License 2.0
40.47k stars 5.19k forks source link

Masked language modeling? #1386

Open serendipity800 opened 1 year ago

serendipity800 commented 1 year ago

Is there an existing issue for this?

Current Behavior

I can't find anything about this.

Expected Behavior

Can we use the ChatGLM to train on the masked language modeling task, just like BERT? Like "[MASK] is the Capital of U.S.", given the correct answer Washington, can we use this to train ChatGLM?

Steps To Reproduce

  1. I don't know

Environment

- OS:
- Python:
- Transformers:
- PyTorch:
- CUDA Support (`python -c "import torch; print(torch.cuda.is_available())"`) :

Anything else?

No response