24/10/20 - Githubissues

junhwi / next-gen-ai

0 stars 0 forks source link

Open junhwi opened 1 month ago

junhwi commented 1 month ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

shylee2021 commented 1 month ago

Thinking LLMs: General Instruction Following with Thought Generation https://arxiv.org/abs//2410.10630v1