iggyray / llms-planning

A benchmark for evaluating large language models in planning
0 stars 0 forks source link

Compare zero-shot vs one-shot prompting #21

Closed iggyray closed 3 weeks ago