iggyray / llms-planning

A benchmark for evaluating large language models in planning
0 stars 0 forks source link

feat: validation experiment results and compiled results #20

Closed iggyray closed 1 month ago

iggyray commented 1 month ago

This PR contains the results from the validation experiment as well as the compiled results.

resolves #19