iggyray / llms-planning

A benchmark for evaluating large language models in planning
0 stars 0 forks source link

Test baseline pipeline with llama3:80b #25

Closed iggyray closed 4 weeks ago