Open shindavid opened 1 year ago
Note: it is likely that this comment from the KataGo paper applies to this idea:
Except for introducing a minimum necessary amount of entropy, the above settings very likely have only a limited effect on overall learning efficiency and strength. They were used primarily so that KataGo would have experience with alternate rules, komi values, handicap openings, and positions where both sides have played highly suboptimally in ways that would never normally occur in high-level play, making it more effective as a tool for human amateur game analysis.
Implement the following idea from Appendix D of the KataGo paper:
Some thought is needed on how to generalize this for games besides go. The komi-adjustment in particular has no clear analog in other games. It might be the case that there is no good way to generalize this.