instadeepai / DebateLLM

Benchmarking Multi-Agent Debate between Language Models for Truthfulness in Q&A.
Apache License 2.0
12 stars 2 forks source link

Feature: Update the README to include GPT-4 results #4

Closed DriesSmit closed 6 months ago

DriesSmit commented 6 months ago

Why?

Update the README with the latest GPT-4 results.

What?

Updated the README file.