Open mw66 opened 2 months ago
I am not really sure what is the cause but I have a small guess, where you running in a distributed context / multiple process ? I solved an issue with serialization in the TKAN that was related to how the backend influence (it was working in jax but not torch or tensorflow) initializer in keras_efficient_kan ! Try install the latest tkan version (0.4.3) and try again, it may be as simple as that. If not working could you give me your python depencies setup and a reproductible example (with just random data generator) so I can find out !
Encountered a strange error: in the middle of the training,
With the following model shape: