Aunsiels / alphazero_janggi

4 stars 0 forks source link

Good morning? #1

Closed cjssh1002 closed 3 years ago

Aunsiels commented 3 years ago

Hi!

It is nice to see you are interested in this project! Indeed, I am not Korean, but I am working with a friend who is a professional Janggi player: https://www.youtube.com/channel/UCksfz2MkgxAKNJDvzemO3iA .

I am currently training neural networks on GPUs. I did the first try with 20 residual blocks, and I am now trying with 40, like in the original paper. I will put on Github the best network I got so far, but it did not reach a level good enough. It still knows some openings.

On the first try, I pretrained the network with existing games (supervised learning), but I had the feeling that it makes the network learn only one good move, and thus the exploration is not that good. Now I am learning without existing games.

The main problem is that I do not have enough computational power, so it takes time and I cannot really finetune the learning parameters.

We are also developing other tools for Janggi if you are interested!

Aunsiels commented 3 years ago

For your questions:

Real games are useful in the case of supervised learning. I already have some https://github.com/Aunsiels/alphazero_janggi/tree/main/data, but I did not find many. So, if you have more, it would be great to add them and see if it helps. As we do not speak Korean, it is hard for us to find resources online.

For Windows 10, I have not tried yet. I will see if I can program a small GUI to test the program.