NVIDIA / Megatron-LM

Ongoing research training transformer models at scale
https://docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start
Other
10.42k stars 2.33k forks source link

about the "Multi-Stage Prompting for Knowledgeable Dialogue Generation" #230

Closed bingfeiz closed 3 months ago

bingfeiz commented 2 years ago

First of all thank you very much for your work. When I tried to reproduce the model code of MSDP I found that the checkpoint mentioned in the paper was missing. I found that only checkpoints for Bert and GPT are provided in the readme. So I would like to ask if you can provide me with the checkpoints used in the model.

ZHUANG-jt commented 2 years ago

I have the same problem. I tried to reproduce this work, but couldn't find the model or code here. It would be appreciated if you could provide it

github-actions[bot] commented 1 year ago

Marking as stale. No activity in 60 days. Remove stale label or comment or this will be closed in 7 days.