Open iggyray opened 1 month ago
Based on experiment results in ./plan-bench/results/blocksworld_3/compiled_report_no_delimiters_1.json
:
Organising the prompt with delimiters seems to have little effect on validation accuracy. If anything, a slight decrease in accuracy was observed, particularly for shorter plans.
llama3:80b
is not able to consistently validate a valid plan.