Open BasselAli1 opened 5 years ago
I know that there is supposed to be
'<unk>'
in the vocabulary so that in testing or in evaluation using validation data, the'<unk>'
token will be mapped to a vector of zeros(in the glove vectors). But I don't see that in this project. Am I right? I am still new in deep learning so maybe i am wrong 😅 in thecreate_dictionary()
function you are using all the datasets to build the vocabulary. Shouldn't we only use the train dataset? Please correct me if I am wrong 😅
I also encountered the same problem, have you solved it?
I know that there is supposed to be
'<unk>'
in the vocabulary so that in testing or in evaluation using validation data, the'<unk>'
token will be mapped to a vector of zeros(in the glove vectors). But I don't see that in this project. Am I right? I am still new in deep learning so maybe i am wrong :sweat_smile:in the
create_dictionary()
function you are using all the datasets to build the vocabulary. Shouldn't we only use the train dataset? Please correct me if I am wrong :sweat_smile: