feat: validation experiment results and compiled results

iggyray / llms-planning

A benchmark for evaluating large language models in planning

0 stars 0 forks source link

Closed iggyray closed 1 month ago

iggyray commented 1 month ago

This PR contains the results from the validation experiment as well as the compiled results.

resolves #19