double application of sigmoid affects last-layer accuracy calculation

I think there is a bug in the combination of the forward method of class decolle.LenetDECOLLE with the decolle.utils.test calculation.

Specifically, when iterating over the layers the model determines whether it's at the final layer, and if so, applies a sigmoid to the membrane potential rather than the surrogate gradient function:

            if i+1 == self.num_layers:
                s_ = sigmoid(u_p)
            else:
                s_ = lif.sg_function(u_p)

Because the dropout and readout functions of the final layer are the identity, the final readout output r_out[-1] of forward has been passed through the sigmoid.

When the decolle.utils.test method calculates the accuracies, it passes all of the final read-out values through a sigmoid, including the layer above, which has already been passed through a sigmoid.

This seems to have a detrimental effect on the reported accuracies of the final layer.

(In theory, the sigmoid is monotonic, and since the follow-up call to decolle.utils.prediction_mostcommon uses the argmax, it shouldn't matter, but since the sigmoid saturates for single-precision floating values, the argmax is likely making arbitrary choices between ties at 1.00000.)

Assuming my interpretation is correct, I would gladly submit a PR to patch, but it seems there are several larger pieces at play that may preclude an obvious fix. Options include:

changing the use of sigmoid (as shown above) in the decolle.LenetDECOLLE.forward method to the identity; this would break the use of cross_entropy_one_hot in the output layer training loss function.
adding a test for the special case n+1==len(net) in decolle.utils.test to avoid putting the readout through a second sigmoid in the last layer; I think this would "break" the behavior for nets where the with_output_layer is set to False, because of this test.

Thoughts?

nmi-lab / decolle-public

double application of sigmoid affects last-layer accuracy calculation #16