🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
1.51k
stars
285
forks
source link
why it didn't use softmax in computing multichoice loss #51
Closed
eveliao closed 5 years ago
the input are just logits, not normalized by softmax, why can we directly compute the cross entropy loss with it and y?
https://github.com/huggingface/pytorch-openai-transformer-lm/blob/bfd8e0989c684b79b800a49f8d9b74e559298ec2/loss.py#L22