pytorch / ELF

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
Other
3.37k stars 567 forks source link

[elfgames::go::ThreadedWriterCtrl-1] [info] no message, sleeping for 10s #69

Closed alatyshe closed 6 years ago

alatyshe commented 6 years ago

help, i got this error when try to run on train:

In game start
No previous model loaded, loading form ./myserver
[2018-07-18 09:09:18.853] [rlpytorch.model_loader.ModelLoader-0-model_index0] [info] Loading model from ./myserver/save-0.bin
[2018-07-18 09:09:18.853] [rlpytorch.model_loader.ModelLoader-0-model_index0] [info] replace_prefix for state dict: [['resnet.module', 'resnet']]
[2018-07-18 09:09:18.918] [rlpytorch.model_loader.ModelLoader-0-model_index0] [info] Finished loading model from ./myserver/save-0.bin
[2018-07-18 09:10:18.530] [elfgames::go::ThreadedWriterCtrl-1] [info] no message, sleeping for 10s
[2018-07-18 09:10:28.530] [elfgames::go::ThreadedWriterCtrl-1] [info] received message
[2018-07-18 09:11:28.531] [elfgames::go::ThreadedWriterCtrl-1] [info] no message, sleeping for 10s
[2018-07-18 09:11:38.531] [elfgames::go::ThreadedWriterCtrl-1] [info] received message
[2018-07-18 09:12:38.531] [elfgames::go::ThreadedWriterCtrl-1] [info] no message, sleeping for 10s
[2018-07-18 09:12:48.531] [elfgames::go::ThreadedWriterCtrl-1] [info] received message

i think i had problem with _serveraddrs.py. In my case i got some server wich i connect via ssh. For geting myserver_addrs i type:

ifconfig -a

and get something like this: inet addr:127.0.0.1, so i type it in _serveraddrs.py maybe i do it wrong, so please help me.

qucheng commented 6 years ago

I think this is expected. How long have you run the client?

alatyshe commented 6 years ago

I run it once(after start_server.sh) and immediately get that message. I waited a few minutes for some changes, but don't get it and terminate script.

qucheng commented 6 years ago

After start_server.sh, you also need start_client.sh in another process for them to connect

qucheng commented 6 years ago

closing. Please reopen with additional questions.