benchmarking with GPT-2

salesforce / ctrl

Conditional Transformer Language Model for Controllable Generation

https://arxiv.org/abs/1909.05858

BSD 3-Clause "New" or "Revised" License

1.87k stars 208 forks source link

Open leejason opened 5 years ago

leejason commented 5 years ago

Any suggestion for benchmarking CTRL with GPT-2? Say, loss value, PPL, or any metric to measure text generation quality?

julien-c commented 5 years ago

Not a direct answer to your question, but this (timely) article by @chiphuyen is really good

leejason commented 5 years ago

very helpful & thanks