Hzfinfdu / Diffusion-BERT

ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
Apache License 2.0
296 stars 25 forks source link

Here are some of my problems, please advise #15

Closed CCCCCCCABC closed 3 months ago

CCCCCCCABC commented 1 year ago

Hi, Dear author :

  1. the paper is in q (Xt | X0) this part if you use the denominator instead not to calculate?
  2. Why predict that x0 is a floating point number and not map it to one-hot?
  3. step = t - 1 and step = t + 1 appear frequently in your code. Do they have a specific meaning? thank you