alphazero Search Results

947 results
for alphazero

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lightvector/KataGo #612

multiStoneSuicideLegal config of KataGo

Hello KataGo Author , 　I am an amateur go player , on the Internet I am 9D go player. There is a suggestion I hope you to modify. 　All the human race rules of the go are not allow multiStone Suicide…

wukevinboy updated 2 years ago
3
ucfai/knightros-gambit #110

Develop way to evaluate performance of our model

Ideas: # Comparing by win/loss against other agents - Compare against an agent that selects actions randomly - Compare against our previous best baseline agent - Compare against stockfish (of va…

nashirj updated 2 years ago
4
Zeta36/chess-alpha-zero #26

Data format?

Could someone write a quick documentation of the input planes? Here's what I think it is: The last 8 board positions. each one 8x8x12 Current state, also 8x8x12 Side to move, 8x8 constant Move nu…

Akababa updated 6 years ago
3
arXivTimes/arXivTimes #1477

Mastering Atari, Go, Chess and Shogi by Planning with a Lear…

## 一言でいうと環境の動作と戦略を同時かつEnd-to-Endに学習する手法の提案。モンテカルロ木探索がベースだが、シミュレーションは実環境でなくモデルベースで行う。実際の行動軌跡はReplay Bufferに格納し、そこからサンプルした軌跡(実行動)から学習を行う。囲碁・チェス・将棋でAlphaZero、AtariでR2D2を上回る。 ![image](https://user-…

icoxfog417 updated 3 years ago
2
jonathan-laurent/AlphaZero.jl #175

Internal error: encountered unexpected error in runtime: Rea…

``` Internal error: encountered unexpected error in runtime: ReadOnlyMemoryError() Please submit a bug report with steps to reproduce this fault, and any error messages that follow (in their enti…

Snimm updated 1 year ago
1
google-deepmind/open_spiel #1122

python/examples/alpha_zero.py crashes with `CUDA_ERROR_NOT_I…

I'm running Ubuntu 22.04 WSL2, and I've tried running this with both `tensorflow==2.14.0` and `tf-nightly==2.15.0.dev20231010`. I am using `Python 3.11.5`, which is supported by the latest version of …

jthemphill updated 2 months ago
5
AlexMGitHub/Checkers-MCTS #1

APIs and GUI

Hi! Thank you for your project, it is the best checkers RL repo I've seen so far. You've done truly great work! I've got several questions regarding APIs and maybe future development: 1. Have you…

whatevernevermindbro updated 1 year ago
1
suragnair/alpha-zero-general #247

Adoption for single player game

Any suggestions for changing the code such that we can adopt it for single player game in which the rules are available and the goal is to get the highest score? For example, `snake eating egg` game a…

vsahil updated 4 months ago
4
kmcrage/leela_lite #1

Exploiting Variance

Implement the paper: Exploiting Variance Information in Monte-Carlo Tree Search Robert Lieck, Vien Ngo, Marc Toussaint AlphaZero etc. do not use either of the classic definitions of U, but use …

kmcrage updated 5 years ago
1
CuriosAI/sai #72

the relation of winrate and game amount

as we know, leelazero use 400 games for a match and 0.55 gate to pass, sometime it will pass earlier when the winrate is high at less games, such as 360 games and 0.58. they should be similar in proba…

l1t1 updated 4 years ago
1

上一页 1...12 13 14 15 16 17 18...95 下一页

947 results for alphazero

947 results
for alphazero