Open leejason opened 5 years ago
Any suggestion for benchmarking CTRL with GPT-2? Say, loss value, PPL, or any metric to measure text generation quality?
Not a direct answer to your question, but this (timely) article by @chiphuyen is really good
https://thegradient.pub/understanding-evaluation-metrics-for-language-models/
very helpful & thanks
Any suggestion for benchmarking CTRL with GPT-2? Say, loss value, PPL, or any metric to measure text generation quality?