-
### System Info
i do some experiments on vanila_mcts, i did not see some backprop in the mcts tree Search process, more like best of n , every step choose the best step to search
### Who can help?…
-
mcts to low ent / high vent branching.
-
-
Currently takin ~20 seconds for 1000 step simulation
-
### Feature request
Support LLM-guided Self-Refinement MCTS inference method. It has the following features:
- LLM-as-Judge to provide review
- Proposer LLM generates rewriting of the answer, taki…
-
### System Info
python==3.10.15
cuda==11.8-8.8.1
torch==2.4.0
The latest version of code
GPU A100_40G * 8
### Who can help?
@ziyuwan @Gebro13 @
### Information
- [X] The official example scri…
-
"Figure 6: Optimal Action Granularity Depends on Problem Complexity. Comparison between Marco-o1-MCTS (mini-step of 64 tokens) (left) and Marco-o1-MCTS (step) (right) on the MGSM dataset. The model wi…
-
-
How is the MCTS simulation process reflected in the code?
-
This is starting to get somewhat difficult compare to what I first envisioned this to be but thats ok
Need to implement node terminality with self.is_terminal.
This is gained though getting the legal …