ValueError: Number of groups (2) must be a multiple of the number of channels (13).
This is due to tf-tabnet implementing group normalisation with a default of num_groups = 2, and the input dimension not being even and for this dataset it would be a prime number.
Due to mlr3keras doing the work to one hot features behind the scenes, it's not immediately obvious whether the input dimension will be odd or even.
Consequently I would argue we should set it to default at 1L rather than 2L so that the learner will train with default settings even though the upstream package defaults at 2, or to use batch_norm as per the original TabNet paper.
Minimal example:
This is due to tf-tabnet implementing group normalisation with a default of num_groups = 2, and the input dimension not being even and for this dataset it would be a prime number.
Due to
mlr3keras
doing the work to one hot features behind the scenes, it's not immediately obvious whether the input dimension will be odd or even.Consequently I would argue we should set it to default at 1L rather than 2L so that the learner will train with default settings even though the upstream package defaults at 2, or to use batch_norm as per the original TabNet paper.