-
Hello KataGo Author ,
I am an amateur go player , on the Internet I am 9D go player. There is a suggestion I hope you to modify.
All the human race rules of the go are not allow multiStone Suicide…
-
Ideas:
# Comparing by win/loss against other agents
- Compare against an agent that selects actions randomly
- Compare against our previous best baseline agent
- Compare against stockfish (of va…
-
Could someone write a quick documentation of the input planes?
Here's what I think it is:
The last 8 board positions. each one 8x8x12
Current state, also 8x8x12
Side to move, 8x8 constant
Move nu…
-
## 一言でいうと
環境の動作と戦略を同時かつEnd-to-Endに学習する手法の提案。モンテカルロ木探索がベースだが、シミュレーションは実環境でなくモデルベースで行う。実際の行動軌跡はReplay Bufferに格納し、そこからサンプルした軌跡(実行動)から学習を行う。囲碁・チェス・将棋でAlphaZero、AtariでR2D2を上回る。
![image](https://user-…
-
```
Internal error: encountered unexpected error in runtime:
ReadOnlyMemoryError()
Please submit a bug report with steps to reproduce this fault, and any error messages that follow (in their enti…
-
I'm running Ubuntu 22.04 WSL2, and I've tried running this with both `tensorflow==2.14.0` and `tf-nightly==2.15.0.dev20231010`. I am using `Python 3.11.5`, which is supported by the latest version of …
-
Hi!
Thank you for your project, it is the best checkers RL repo I've seen so far. You've done truly great work!
I've got several questions regarding APIs and maybe future development:
1. Have you…
-
Any suggestions for changing the code such that we can adopt it for single player game in which the rules are available and the goal is to get the highest score? For example, `snake eating egg` game a…
-
Implement the paper:
Exploiting Variance Information in Monte-Carlo Tree Search
Robert Lieck, Vien Ngo, Marc Toussaint
AlphaZero etc. do not use either of the classic definitions of U, but use …
-
as we know, leelazero use 400 games for a match and 0.55 gate to pass, sometime it will pass earlier when the winrate is high at less games, such as 360 games and 0.58. they should be similar in proba…