This PR allows you to create an assistant and run inference against a row from the benchmark.
For https://github.com/raymyers/swe-bench-util/pull/6 to work we needed to be able to get back what files are being used under the hood by Assistants inside a run. OpenAI's Assistants API has a placeholder for this but they don't use it.
Good stuff, merging. Fixed a small merge conflict that was due to formatting, in the future ruff format should help avoid that. If there's a way that makes it harder to forget to run that would be neat.
This PR allows you to create an
assistant
and run inference against a row from the benchmark.For https://github.com/raymyers/swe-bench-util/pull/6 to work we needed to be able to get back what files are being used under the hood by Assistants inside a
run
. OpenAI's Assistants API has a placeholder for this but they don't use it.From the OpenAI docs:
I implemented this functionality in astra-assitants (v0.1.9) and released it https://github.com/datastax/astra-assistants-api/pull/15
To test just run: