Closed gabrielvc closed 4 months ago
Hi again, Without the traceback, I can't see exactly what is going on to provide some support. But it seems that your problems are coming from the way your model has been saved. During training, in the config file, you should adjust the s4_lmax hyperparameter as the length of the time series (I left that by default value for PTB-XL, so you shouldn't change it for it, but if is for another dataset, you might change it depend on the length), otherwise, the s4 model will iteratively learn this parameter during training which won't let you load the model later for inference. Hope this helps Best Juan
Hi,
Thank you for you answer. This explains why I needed the batch size.
Best, Gabriel
Hello.
I am currently having trouble running the experiments. The
inference.py
script does not run with models fromtrain.py
. It seems it's a problem with the batch size being used in training and the variablesz
andomega
that are created in one of the S4 inner layers. I can't make it work by creating a new net, then using a dummy input of the same batch size and then loading the model. Is this normal?