Closed guillemram97 closed 9 months ago
Right now, the raw runs JSON is the best way to get this data. You can access it through "Full JSON" (the scenario_state.json
file). The schema for this file is defined in this Python dataclass; you could write a Python script to drill down to the specific data you need.
I want to store the answers from multiple models on multiple tasks to do Knowledge Distillation, but I'm broke and I can't afford to run them. I was thinking of using data from this project. Can we access full json files via an API/dataset? E.g., if we wanted to get the responses of openai/text-d avinci-003 on babi_qa, is there a way to do it other than looking for the run on the project site and then downloading the full json?