iggyray / llms-planning

A benchmark for evaluating large language models in planning
0 stars 0 forks source link

feat: setup base pipeline #26

Closed iggyray closed 4 weeks ago

iggyray commented 4 weeks ago

This PR sets up a simple pipeline to test the base plan-bench pipeline with llama3:80b

resolves #25