alphazero Search Results

962 results
for alphazero

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jonathan-laurent/AlphaZero.jl #175

Internal error: encountered unexpected error in runtime: Rea…

``` Internal error: encountered unexpected error in runtime: ReadOnlyMemoryError() Please submit a bug report with steps to reproduce this fault, and any error messages that follow (in their enti…

Snimm updated 1 year ago
1
opendilab/LightZero #270

Configs are unrunnable due to ImportError of smz_tree

Some configs I am able to run in version 0.0.3 (https://github.com/opendilab/LightZero/commit/3cb7fff41f65bb21463418f8a161818ed6a33f93) cannot be run in the latest main branch. For example, `zoo/bo…

TianrenWang updated 1 week ago
2
kmcrage/leela_lite #1

Exploiting Variance

Implement the paper: Exploiting Variance Information in Monte-Carlo Tree Search Robert Lieck, Vien Ngo, Marc Toussaint AlphaZero etc. do not use either of the classic definitions of U, but use …

kmcrage updated 5 years ago
1
arXivTimes/arXivTimes #1287

Learning Compositional Neural Programs with Recursive Tree S…

## 一言でいうと DNNの実行で、再利用可能なモジュールを組み合わせる試み。全体としては強化学習(AlphaZero)の枠組みで、環境以外にプログラムのembeddingを受け取りLSTMで処理、Action/Valueを出力するという流れになっている。LSTMはプログラムの実行環境として動作し、STOPというactionが出るまで実行を続ける。 ![image](https://u…

icoxfog417 updated 5 years ago
1
CuriosAI/sai #72

the relation of winrate and game amount

as we know, leelazero use 400 games for a match and 0.55 gate to pass, sometime it will pass earlier when the winrate is high at less games, such as 360 games and 0.58. they should be similar in proba…

l1t1 updated 4 years ago
1
algorithm004-01/algorithm004-01 #918

【566-Week 06】学习总结

# Tire 树代码模板 ` class Trie(object): def __init__(self): self.root = {} self.end_of_word = "#" def insert(self, word): node = self.root for char in word: node = node.s…

showx updated 4 years ago
2
jonathan-laurent/AlphaZero.jl #159

Support for multiplayer games?

I saw a YouTube video suggest that this was difficult in principle due to the possibility of the agent forming cartels (i.e. it learns that it's always best to cooperate with position 2 if it finds it…

kendonB updated 1 year ago
2
TDteach/AlphaZero_ChineseChess #1

About the result of self play training

Hello, I have implemented a [Chinese chess AI](https://github.com/NeymarL/ChineseChess-AlphaZero) based on this repo but my training result is really bad. After supervised learning for 10K games, it b…

NeymarL updated 6 years ago
9
danijar/dreamerv3 #151

Rookie Issue: How to connect to MineCraft or Other Games?

I'm new in Deep Reinforcement Learning and my supervisor shows great interets in this work. I have successfully run the code here, but I dont know how to connect this work to the games, I wanna since…

yuukix77 updated 1 week ago
2
kevaday/alphazero-general #22

A simple attempt at distributed training

Could it be possible to have multiple machines all generate training games over the same network and send the games generated to a "master" machine which will use the training data and train a new ver…

Bobingstern updated 1 year ago
2

上一页 1...13 14 15 16 17 18 19...97 下一页

962 results for alphazero

962 results
for alphazero