muzero Search Results - Githubissues

397 results
for muzero

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openai/baselines #1085

can i apply any baseline algo to game like chess?

hi, i was wondering if i can apply the baselines algo on the game chess either https://github.com/genyrosk/gym-chess or a custom chess env. based on python-chess. i was thinking 18x8x8 input and 6…

Unimax updated 4 years ago
2
YeWR/EfficientZero #23

Zero score on Freeway

I tried to run the code for Atari Freeway using the following command with the default settings in the code: ```bash python main.py --env FreewayNoFrameskip-v4 \ --case atari \ --opr train \ --am…

emailweixu updated 2 years ago
6
enpasos/muzero #2

Help wanted: single player mode

Hi @enpasos, Any idea how to implement single player games? E.g. Cartpole, Lunar Lander or Breakout? And print out current max score archived. Thanks in advance.

lukaszkn updated 2 years ago
4
werner-duvaud/muzero-general #162

If I know the environment, is it better to train alphazero?

If I have access to the environment model, is it faster/better to train alphazero instead? thanks

omgmax updated 2 years ago
1
epignatelli/helx #55

Implement DeepRL agents

`Agent`s are entities with a `sample_action` and `update` method, in potence. We exclude from the list exploration strategies and curricula. _Implement_ means either to produce new code from the pape…

epignatelli updated 1 year ago
1
werner-duvaud/muzero-general #213

Can't train using GPU? The torch version for this environmen…

### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar feature requests. ### Description Hi, I will appr…

SunilaAkbar updated 1 year ago
2
XMuli/ChineseChess #30

【功能请求】pc象棋AI，自我深度学习

可以加入AI训练吗？我们可以训练AI 我们需要最新的AI棋谱 https://katagotraining.org/ 这是一个非常成功的AI训练例子，你可以参考看看。虽然它是围棋的，它现在已经拥有2千万张棋谱，还在不断的增长。象棋很多人还用那些古谱来练习象棋，那些已经过时了，象棋比围棋更有优势，象棋可以马上让人进入深度计算思维。仙女鳕鱼象棋AI引擎 https://gith…

Pantyhose-X updated 1 year ago
5
CarperAI/trlx #28

Self Play

Self play, and generally multi-LM-agent settings are something we are very interested in exploring. What does it take to support this? Does it already work without big overheads?

cat-state updated 1 year ago
3
YeWR/EfficientZero #14

Question: Why not reanalyze 100% policy targets?

Hi there, First of all, great work and thank you for opensourcing your code! I have a question regarding reanalyze: you chose to reanalyze 99% of policy targets and 100% of value targets. I am j…

Hwhitetooth updated 2 years ago
1
YuriCat/MuesliJupyterExample #2

no target network ?

There is no target network which is noted in the article in this implementation?

qianfangjj updated 2 years ago
2

上一页 1...3 4 5 6 7 8 9...40 下一页

397 results for muzero

397 results
for muzero