XiangLi1999 / Diffusion-LM

Diffusion-LM
Apache License 2.0
1.02k stars 133 forks source link

Separate weights for word embedding and lm-head? #54

Open zeyun-zhong opened 1 year ago

zeyun-zhong commented 1 year ago

Hi, thanks for providing the code.

I have a question regarding the word embedding and lm-head. In your code, both functions shard the same weight. I wonder if they can have separate weights?

Thanks for your help!