The metric of GPT-4 Eval

jongwooko / distillm

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

https://arxiv.org/abs/2402.03898

145 stars 21 forks source link

Closed liuxy1103 closed 8 months ago

liuxy1103 commented 8 months ago

How is GPT-4 Eval calculated? Can you provide the following assessment codes in addition to rouge-L?

jongwooko commented 8 months ago

Hi. Thank you for showing interest in our work.

We will make our customized evaluation code available as soon as possible.