Word embeddings input size

Hi and thanks for the great series about transformers!

I noticed that you initialize the nn.embeddings layer for the word embeddings with an input size that is equal to the vocabulary size. As we would like to add the positional encodings with dimensions max_seq_length X 512 on top of the embeddings, the dimensions of the words embeddings should be the same as the ones for the positional embeddings (max_seq_length X 512).

So the corrected code would look something like:

class SentenceEmbedding(nn.Module):
    "For a given sentence, create an embedding"
    def __init__(self, max_sequence_length, d_model, language_to_index, START_TOKEN, END_TOKEN, PADDING_TOKEN):
        super().__init__()
        self.vocab_size = len(language_to_index)
        self.max_sequence_length = max_sequence_length
        self.embedding = nn.Embedding(self.max_sequence_length, d_model) #CORRECTED LINE....
        ...

Regards,

ajhalthor / Transformer-Neural-Network

Word embeddings input size #6