Closed andrewaf1 closed 2 years ago
Hi @andrewaf1 ! Yes, it is a bug from our side, sorry for inconvenience. I have already made a PR to fix this, we will merge it soon
@andrewaf1 try v0.1.29
please
@joein Thanks! It now trains without error, although when I load my model to do inference I get vasty reduced accuray. Will examine further tomorrow or Monday to determine if I am doing something wrong.
@andrewaf1
The problem may lie in the amount of data on which you calculate your metrics.
If you look at the results of AttachedMetric
and then compare them to the metrics you get across the entire dataset, there is likely to be a significant difference.
This is because AttachedMetric
only sees a fixed-size batch, but during inference there is the entire dataset (and more difficult examples).
I'd recommend using Evaluator to get a better sense of metrics you will obtain on inference.
Example of usage Evaluator
can be found in our Q&A demo.
This issue seems to be fixed, I am going to close it soon. Please feel free to open a new issue or a discussion if you find anything odd or need help.
When trying to use SWA to improve generalization for my triplet network, I came across the following error. I believe SWA creates copies of the model and I could see how this might trigger something like this. I have recently used SWA on a non-quaterion network, so I a pretty sure this is not a Lightning issue.