About which transformers architecture was used for generative step?

facebookresearch / EmpatheticDialogues

Dialogue model that produces empathetic responses when trained on the EmpatheticDialogues dataset.

Other

450 stars 63 forks source link

About which transformers architecture was used for generative step? #28

Closed 17521121 closed 4 years ago

17521121 commented 4 years ago

I saw in paper said that you use the full Transformer architecture (Vaswani et al., 2017), but you know transformers had many architectures right now, GPT2, Bert, Robert, ... and each architecture has its own tasks.

EricMichaelSmith commented 4 years ago

Yes - for generation we used the original Transformer architecture from Vaswani et al, as opposed to those other newer architectures.