instadeepai / jumanji

🕹️ A diverse suite of scalable reinforcement learning environments in JAX
https://instadeepai.github.io/jumanji
Apache License 2.0
584 stars 71 forks source link

feat(sudoku): data augmentation #153

Open Egiob opened 1 year ago

Egiob commented 1 year ago

The Sudoku problem has many symmetries and permutations to which it is invariant or equivariant. The currently used generators works by sampling existing instances from a fixed database. To improve both the diversity of the data and the generalization capabilities of the agents it could be beneficial to apply data augmentation to the existing instances in the database.

One strategy could be to draw a random data augmentation each time an instance is sampled from the database.

Possible permutations include:

See this for a more comprehensive view.