Hi, thanks for your great work! I notice that you used ChatGPT for scorer but it seems that there is no place for us to insert our own token. Does this mean we cannot use this scorer for arbitary model?
Moreover, do you think it can be used for a evaluation metric of llm output? Thanks.
Hi, thanks for your great work! I notice that you used ChatGPT for scorer but it seems that there is no place for us to insert our own token. Does this mean we cannot use this scorer for arbitary model?
Moreover, do you think it can be used for a evaluation metric of llm output? Thanks.