gsm8k cleanup - Githubissues

microsoft / promptbase

All things prompt engineering

MIT License

5.44k stars 302 forks source link

gsm8k cleanup #6

Closed nking-1 closed 11 months ago

nking-1 commented 11 months ago

Fixes needed to make the gsm8k benchmark run against Azure OpenAI GPT-4 chat model.

Also contains some additional small changes:

Removes the global prompt list in the gsm8k module, making it a scoped variable instead. We'll be doing something similar for the other scripts that followed this pattern.
Makes generations/ directory which will be used to store the intermediate outputs moving forward
Fixes a log file encoding issue
Adds "bigbench" into the valid datasets list