Why use Softplus function in loss?

mpezeshki / pytorch_forward_forward

Implementation of Hinton's forward-forward (FF) algorithm - an alternative to back-propagation

MIT License

1.44k stars 139 forks source link

In the train function, your code is a softplus function. $$loss = ln(1 + e^x)$$ But in The Forward-Forward Algorithm: Some Preliminary Investigations, Hinton uses logistic function. $$p = \sigma(\Sigma y^2 - \theta)$$ Here, $$\sigma(x) = \frac{1}{1 + e^{-x}}$$

Is this a mistake or a better choice？

    def train(self, x_pos, x_neg):
        for i in tqdm(range(self.num_epochs)):
            g_pos = self.forward(x_pos).pow(2).mean(1)
            g_neg = self.forward(x_neg).pow(2).mean(1)
            # The following loss pushes pos (neg) samples to
            # values larger (smaller) than the self.threshold.
            loss = torch.log(1 + torch.exp(torch.cat([
                -g_pos + self.threshold,
                g_neg - self.threshold]))).mean()
            self.opt.zero_grad()
            # this backward just compute the derivative and hence
            # is not considered backpropagation.
            loss.backward()
            self.opt.step()
        return self.forward(x_pos).detach(), self.forward(x_neg).detach()

mpezeshki / pytorch_forward_forward

Why use Softplus function in loss? #10