harness: --log_samples - Githubissues

logikon-ai / cot-eval

A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.

MIT License

12 stars 2 forks source link

Open ggbetz opened 4 months ago

ggbetz commented 4 months ago

Use --log_samples when calling harness and upload them in separate repo for later diagnostics: