instadeepai / DebateLLM

Benchmarking Multi-Agent Debate between Language Models for Truthfulness in Q&A.
Apache License 2.0
12 stars 2 forks source link