I would appreciate it if you could provide me with more about the details when you use APE as the baseline.
I think you use chatgpt to generate the prompts. But what is your score function when you use APE? What are your criteria for choosing prompts in the first part of APE. (I have seen the FQA, you don't use log because of API.)
And what prompts (system_content) you use to instruct Chatgpt?
Nice work!
I would appreciate it if you could provide me with more about the details when you use APE as the baseline.
I think you use chatgpt to generate the prompts. But what is your score function when you use APE? What are your criteria for choosing prompts in the first part of APE. (I have seen the FQA, you don't use log because of API.)
And what prompts (system_content) you use to instruct Chatgpt?
thanks