Closed BramVanroy closed 1 year ago
I have never seen this problem, but I think it probably comes from mip, yes. I find this issue there on a similar problem https://github.com/coin-or/python-mip/issues/254
I see you also filed an issue there, that's good.
For now, to not risk any stoppage while performing training, you can use
solver = solvers.HillClimber()
which is also the default. For training, the feedback from the hill-climber may be sufficient. Only for final evaluation optimal solving should be applied.
PS: which version of mip are you running? I see that they're now at 1.15.0. Maybe there was a fix for this bug.
Thank you for the quick response! The error occurred with 1.15.0. I now downgraded to 1.13.0 as per the recommendation in your README but I have no results yet.
It is a good idea to use the hill climber for training and ILP for final evaluation. That should be less problematic in my use case.
Because you suggest it is an mip
specific problem and not caused by smatchpp, I'll close this and hope that the people at mip
can find a solution but it seems that it is not that easy...
Great, feel free to re-open it anytime.
In case you manage/happen to reproduce this bug, I'd be very interested to see what's going on, might also be itneresting for the folks that develop mip
.
Hello
I've just reimplemented my neural network training pipeline and instead of
smatch
I am usingsmatchpp
. Overall this works great, so thank you for your work!Unfortunately however I sometimes get a terminal error that disrupts the whole training loop and it cannot be recovered. I have also reported this here. I do not know how to debug this so I am wondering/hoping that you have experience with a similar issue when using
mip
during testing your library.This is the error trace, but I can't figure out how to read it. Is
mip
the trigger, or istorch
the trigger? Does it have to do with distributed training? How can I debug this? A lot of questions... So if you have any insights, they are very welcome because this is stopping me from using it in my code as it completely destroys the training progress.smatch
does not rely on mip as far as I know.