Ber666 / RAP

Reasoning with Language Model is Planning with World Model
https://arxiv.org/abs/2305.14992
MIT License
137 stars 15 forks source link

GPT? #6

Closed shuqike closed 1 year ago

shuqike commented 1 year ago

Have you guys tried mcts with gpt-3.5-turbo or gpt-4? I know openai api does not provide tokenizer access so we cannot have accurate log probs of action phrases.

Ber666 commented 1 year ago

Thanks for your question. We didn't try GPT in our experiment and we agree on your points. It seems impossible to get log probs with OpenAI API currently. However, we want to note that RAP is a framework compatible with any reward design, and there are other rewards you can get with GPT, e.g., confidence, self-evaluation, etc.