Closed lwzhaojun closed 4 years ago
The training objective in the original paper is theta, should the loss value be theta? Why is loss in your program a cross entropy loss?
The training objective in the original paper is theta, should the loss value be theta? Why is loss in your program a cross entropy loss?