lightvector / KataGo

GTP engine and self-play learning in Go
https://katagotraining.org/
Other
3.47k stars 563 forks source link

Sabaki, gogui error is coming out. #732

Open mc-mong opened 1 year ago

mc-mong commented 1 year ago

Opencl ..

Do not opencl operation.

Sabaki, gogui error is coming out.

version : 1.20.0 opencl

60block : kata1-b60c320-s6794945792-d3074237803.bin.gz kata-genmove_analyze W 50 = Got nonfinite for policy sum HASH: B36725E12621D50CABD1F49562C7CDC4 A B C D E F G H J K L M N O P Q R S T 19 . . . . . . . . . . . . . . . . . . . 18 . . . . . . . . . . . . . . . . . . . 17 . . . . . . . . . . . . . . . . . . . 16 . . . X . . . . . . . . . . . . . . . 15 . . . . . . . . . . . . . . . . . . . 14 . . . . . . . . . . . . . . . . . . . 13 . . . . . . . . . . . . . . . . . . . 12 . . . . . . . . . . . . . . . . . . . 11 . . . . . . . . . . . . . . . . . . . 10 . . . . . . . . . . . . . . . . . . . 9 . . . . . . . . . . . . . . . . . . . 8 . . . . . . . . . . . . . . . . . . . 7 . . . . . . . . . . . . . . . . . . . 6 . . . . . . . . . . . . . . . . . . . 5 . . . . . . . . . . . . . . . . . . . 4 . . . . . . . . . . . . . . . . . . . 3 . . . . . . . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . .

lightvector commented 1 year ago

Thanks for reporting the error. Does it happen for smaller nets than 60b?

Also, what is the output of "./katago tuner" using the 60b net on v1.11 versus v1.12? Perhaps the two ended up making very different choices of parameters to use, and one set of chosen parameters doesn't work?

mc-mong commented 1 year ago

Delete the tuning file and re-tune it again. This is normal.

You must delete the existing tuning file and re-create the tuning file.

Thank you.

lightvector commented 1 year ago

As posted in this comment https://github.com/lightvector/KataGo/issues/733#issuecomment-1374964495 this version of KataGo may be more consistent in the OpenCL tuning. If you and other people can confirm that it works well, then I will release it as another bugfix release. Otherwise I will keep iterating on it. Thanks!