mtkresearch / TCEval

2 stars 2 forks source link

How can I use OpenAI GPT to judge MT-Bench-TW #3

Open ZoneTwelve opened 5 months ago

ZoneTwelve commented 5 months ago

Recently I've been trying to use MT-Bench-TW to evaluate someone else's model. However, mt_bench_tw did not provide a reference answer for GPT-4.

adamlin120 commented 4 months ago

+1

ftmtk commented 4 months ago

See the other issue #4