muzero-general Search Results

99 results
for muzero-general

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

werner-duvaud/muzero-general #115

Reusing replay_buffer with new model give KeyErrors on self.…

### Scenario I'd like to reuse the replay_buffer from an earlier run, but on a slightly different model. Is there anyway to do this correctly? I've tried copying just the replay_buffer.pkl file to …

schwab updated 3 years ago
1
werner-duvaud/muzero-general #111

Unhandled errors in replay_buffer.py

Recently while training I started getting the following errors. ``` 2021-01-11 16:34:00,823.ERROR worker.py:980 -- Possible unhandled error from worker: ray::ReplayBuffer.get_batch() (pid=31490…

schwab updated 3 years ago
2
werner-duvaud/muzero-general #113

Exponentially growing Loss

While attempting to train with resnet, I'm always getting ridiculously high loss rates. Eventually they go so high, the update_weights process goes out of bounds and crashes. What factors can be con…

schwab updated 3 years ago
1
goshawk22/alpha-zero-chess #1

Add an open_spiel game wrapper

I saw your muzero-general repo. It had an open_spiel game wrapper. Can you please add it here too? Keras is kinda slow. Also all games result in a draw. So no new model is selected.

ChessProfessorX updated 3 years ago
3
werner-duvaud/muzero-general #119

no "learning" effect: loss calculation converges, but stagna…

I've performed several tests (different games) using the initial configuration, and i always get mean values (Total reward/mean values [plot 2 / tictactoe]) alternating around 0. So there seems to be …

akopf82 updated 3 years ago
11
deepjavalibrary/djl #521

scale gradient for backward pass

## Question or maybe Enhancement I'm missing a feature to scale the gradient for backward pass (as e.g. used in MuZero) ... something like tensor * scale + stop_gradient(tensor) * (1 - scale) I'm…

enpasos updated 3 years ago
6
werner-duvaud/muzero-general #99

Cartpole performance very slow

Hello, I just tried the colab notebook (copied from https://github.com/werner-duvaud/muzero-general/blob/master/notebook.ipynb). After 13 minutes it still got a low score (see below). Other framew…

jarlva updated 3 years ago
1
werner-duvaud/muzero-general #78

Runtime Error: module must have its parameters and buffers o…

Dear Mr. Duvaud, Upon cloning your repository to ".../Documents/AI", adding swig-4.0.2 (https://sourceforge.net/projects/swig/files/swigwin/swigwin-4.0.2/swigwin-4.0.2.zip/download?use_mirror=pho…

tslever updated 4 years ago
3
JoshVarty/AlphaZeroSimple #3

Scripts for Self-Play, Console-IO Play, and 2D Connect2

Mr. Varty, Would be willing to provide scripts or ideas for self-play, console-IO play, and/or 2D games? Tom Lever

tslever updated 4 years ago
1
werner-duvaud/muzero-general #79

play_game() game_history.action_history.append(0)

May be change game_history.action_history.append( numpy.random.choice(self.game.legal_actions()) ) ?

nnnet updated 4 years ago
1

上一页 1...4 5 6 7 8 9 10...10 下一页

99 results for muzero-general

99 results
for muzero-general