AIDC-AI / Marco-o1

An Open Large Reasoning Model for Real-World Solutions
Other
960 stars 48 forks source link

About "Solution Space Expansion via MCTS" #17

Closed Arcmoon-Hu closed 1 day ago

Arcmoon-Hu commented 2 days ago

Hello, I'm not too familiar with MCTS. Can I analogize it to beamsearch. In the decoding stage, top_5 is selected each time the next token is decoded. After decoding is completed, the path with the largest score is selected according to the formula in your paper.