Closed philipstephens closed 6 years ago
Here's the error I get when I run it from Arena:
2018-02-13 17:38:02.150 Arena 3.5
2018-02-13 17:38:02.157
2018-02-13 17:38:02.373----------New game---2018-02-13 17:38:02,373 Tue -------------
2018-02-13 17:38:02.415screen: 1920x1080
2018-02-13 17:38:02.415Monitors: 1
2018-02-13 17:38:02.415Monitor0: 1920x1080
2018-02-13 17:38:02.415FormMonitor: 0
2018-02-13 17:38:02.471Loading 1
2018-02-13 17:39:36.7881--------------------------Starting engine 1 C0uci---------------------------
2018-02-13 17:39:36.7891Configured Engine 1 Type: Auto
2018-02-13 17:39:36.7891Engine 1 dir: E:\programming\pyproj\chess\chess-alpha-zero
2018-02-13 17:39:36.7891Engine 1 commandline: E:\programming\pyproj\chess\chess-alpha-zero\C0uci.bat
2018-02-13 17:39:36.9001Child Process Prio Adj: PID 10000 conhost.exe
2018-02-13 17:39:36.9011Child Process Prio Adj: PID 8604 python.exe
2018-02-13 17:39:36.9011Engine 1 ProcessID: 3620
2018-02-13 17:39:36.9011Engine 1 Prio:32 ThreadPrio:0
2018-02-13 17:39:36.901-->1:xboard
2018-02-13 17:39:36.922<--1:E:\programming\pyproj\chess\chess-alpha-zero>python src/chess_zero/run.py uci
2018-02-13 17:39:36.922-->1:uci
2018-02-13 17:39:37.136<--1:id name ChessZero
2018-02-13 17:39:37.136<--1:id author ChessZero
2018-02-13 17:39:37.136<--1:uciok
2018-02-13 17:39:37.1421Child Process Prio Adj: PID 10000 conhost.exe
2018-02-13 17:39:37.1421Child Process Prio Adj: PID 8604 python.exe
2018-02-13 17:39:37.142-->1:isready
2018-02-13 17:39:42.753<--1:Using TensorFlow backend.
2018-02-13 17:39:42.756<--1:Traceback (most recent call last):
2018-02-13 17:39:42.756<--1: File "src/chess_zero/run.py", line 20, in
Philip
It looks like it is trying to load the weights file. What happens when you just run C0uci.bat from a terminal or cmd (no Arena, no --distributed)?
When I enter C0ucu.bat it just sits there and does nothing until I type ctrl-c . It looks as if it is waiting for keyboard input, but I don't know what parameters it is looking for.
Traceback (most recent call last):
File "src/chess_zero/run.py", line 20, in
E:\programming\pyproj\chess\chess-alpha-zero>
You may not be familiar with the chess UCI protocol and commands. First, you should type 'uci' and the engine will respond with its name and author. Then, 'isready' and wait for the 'readyok' response. Next, 'go depth 1' and wait. A move like d2d4 should be output.
E:\programming\pyproj\chess\chess-alpha-zero>python src/chess_zero/run.py uci isready Using TensorFlow backend. 2018-02-14 14:29:39.396510: I C:\tf_jenkins\workspace\rel-win\M\windows-gpu\PY\36\tensorflow\core\platform\cpu_feature_guard.cc:137] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX AVX2 2018-02-14 14:29:39.867269: I C:\tf_jenkins\workspace\rel-win\M\windows-gpu\PY\36\tensorflow\core\common_runtime\gpu\gpu_device.cc:1105] Found device 0 with properties: name: GeForce GTX 1050 Ti major: 6 minor: 1 memoryClockRate(GHz): 1.392 pciBusID: 0000:22:00.0 totalMemory: 4.00GiB freeMemory: 3.30GiB 2018-02-14 14:29:39.873405: I C:\tf_jenkins\workspace\rel-win\M\windows-gpu\PY\36\tensorflow\core\common_runtime\gpu\gpu_device.cc:1195] Creating TensorFlow device (/device:GPU:0) -> (device: 0, name: GeForce GTX 1050 Ti, pci bus id: 0000:22:00.0, compute capability: 6.1) readyok go depth 1 bestmove e2e4 engine.name go depth 1 bestmove e2e4 bestmove e7e5 go bestmove e2e4 quit
Oops, was working again but then I typed: python src/chess_zero/run.py self --type distributed with errors and now
E:\programming\pyproj\chess\chess-alpha-zero>python src\chess_zero\run.py uci
isready
Using TensorFlow backend.
2018-02-14 15:16:33.145562: I C:\tf_jenkins\workspace\rel-win\M\windows-gpu\PY\36\tensorflow\core\platform\cpu_feature_guard.cc:137] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX AVX2
2018-02-14 15:16:33.623236: I C:\tf_jenkins\workspace\rel-win\M\windows-gpu\PY\36\tensorflow\core\common_runtime\gpu\gpu_device.cc:1105] Found device 0 with properties:
name: GeForce GTX 1050 Ti major: 6 minor: 1 memoryClockRate(GHz): 1.392
pciBusID: 0000:22:00.0
totalMemory: 4.00GiB freeMemory: 3.30GiB
2018-02-14 15:16:33.631631: I C:\tf_jenkins\workspace\rel-win\M\windows-gpu\PY\36\tensorflow\core\common_runtime\gpu\gpu_device.cc:1195] Creating TensorFlow device (/device:GPU:0) -> (device: 0, name: GeForce GTX 1050 Ti, pci bus id: 0000:22:00.0, compute capability: 6.1)
Traceback (most recent call last):
File "src\chess_zero\run.py", line 20, in
E:\programming\pyproj\chess\chess-alpha-zero>
It's working with self play now (without the distributed option). I am waiting for it to finish playing a series of games before I try the distributed version. I copied the model_best_weight.h5 and the model_best and the model_best_config.json from a copy of chess zero and then ran src\chess_zero\run.py self.
I've been able to reproduce the problem. Whenever I run
E:\programming\pyproj\chess\chess-alpha-zero>python src\chess_zero\run.py self --type distributed
I get
2018-02-19 09:55:53,511@chess_zero.manager INFO # config type: distributed ... File "h5py\h5f.pyx", line 78, in h5py.h5f.open OSError: Unable to open file (File signature not found)
and I then get the same error when I run E:\programming\pyproj\chess\chess-alpha-zero>python src\chess_zero\run.py self
When I look at the model directory I get 2018-02-19 09:55 AM
but when I delete model_best_config.json and model_best_weight.h5 with a previously saved copy,
E:\programming\pyproj\chess\chess-alpha-zero>python src\chess_zero\run.py self works again, but E:\programming\pyproj\chess\chess-alpha-zero>python src\chess_zero\run.py self --type distributed does not and never seems to work. Do you need any other info to debug this problem?
Thanks, Philip
This might not be an active project, so the distributed option may not apply. I suggest just running without it.
Thanks.
Whenever I run AlphaZero chess for the second time after reinstalling python I get the error
File "h5py\h5f.pyx", line 78, in h5py.h5f.open OSError: Unable to open file (File signature not found)
How do I avoid getting that error. When I run the command src\ chess_zero\run.py self --distributed and want to stop execution I type ctrl-c. How else do I stop the command without getting the above error? Thanks.
Philip