-
This is still pending my completion of the python->c++ port of the MCTS logic, of course. This is taking me longer than one might expect because I am building in all the parallelization niceties that …
-
MCTS will allow the agent to "playout" a game from the current state to generate a distribution over action-values. This will be used to generate a policy: state -> action.
- Will need to be able t…
-
Hi,
in LibreChat connected to optillm proxy served by docker I get following error:
2024-10-21 06:56:48 error: [handleAbortError] AI response error; aborting request: 500 "list index out of rang…
-
### Describe the issue
It may be premature to draw definitive conclusions at this stage, as 100 games may not provide a sufficiently large sample size. However, it appears that Version 36 has shown…
-
-
Hi
I have ran the search, rollout and select_next_move methods but im still not sure how to code the adapt method:
1. In line 52 and 53, what does possible move mean? Is it the move a stop could ha…
-
Hi NeverOnTimeSdnBhd,
There are 2 questions I am unsure of and require your explanation,
1. Is it okay to use level = 3, iterations = 100 in each sample test case?
2. What does the 3D array of poli…
-
1) 3d array initialization
My code is almost the same with Issue #11 (Sir said it was correct), but we got a ArrayIndexOutOfBoundsException error.
I have checked and tried for many times and the er…
-
Smooth UCB:
[Self-Play Monte-Carlo Tree Search in Computer Poker]
https://pdfs.semanticscholar.org/7b68/7599b4425aa959036071030e1212a3b359c7.pdf
-
Hey all,
I played around with a few games that interested me, for maybe doing some research on explainability.
I wanted to collect some game traces for Terraforming Mars, but couldn't get a full…