Problem with kernel integral architecture

Description

The current implementation of the Integral Kernel layer only manages to learn the target function when the target space is discretized by only one value (shape(y) == (batch-size, 1, y-dim)) as it is the case in examples/selfsupervised.ipynb. However, when the target space is discretized with a value larger than 1, the operator fails to learn the target function.

To reproduce the problem see basics.ipynb in the linked branch. I replaced the DeepONet with a single Integral Kernel layer (the problem is still there when we use more than one layer such as the current NeuralOperator implementation). The trained model converges to a constant function which coincides with the average of the target function (mean(v(y))). This is independent of the complexity of the model and the number of epochs.

aai-institute / continuiti

Problem with kernel integral architecture #60

Description