Adding code run failure handling process & Update to align with the newest OpenAI package formats

Description

Adding code run failure handling process:

This is done by query back to the API with the exception message and reward function code. There is a new parameter added in the config.yaml called max_retries that limits the attempts to fix the buggy reward code and prevents infinite loop. The current code can now at least generate 1 legit reward functions and, in most cases, generate all the num_samples to be executable.

Update to align with the newest OpenAI package formats:

According to https://github.com/openai/openai-python/discussions/742 OpenAI has released a new major version of its SDK, and they recommend upgrading promptly. I reimplemented the queries and handling of responses to align with the new requirements.

Type of change

[ ] Bug fix
[x] Feature enhancement/new feature
[ ] Documentation change
[ ] Style/UI change
[ ] Performance optimization
[x] Refactoring/code optimization
[ ] Other changes

https://github.com/eureka-research/Eureka/issues/4#issuecomment-1774193819

Specific changes

Changed the eureka/eureka.py file's query method
Modulized the reward code processing by encapsulating it as a function for code reuse
eureka/cfg/config.yaml Changed by adding parameter max_retries and updating the model to gpt-4-1103-preview

Screenshots

Example of failed run and automatic code fix:

samples

eureka-research / Eureka