Open akhileshkumargangwar opened 6 years ago
Each line of vocab.txt contains (1) the lowered word and (2) its frequency in the training text, i.e., how many times it appears in the training text. The words are ranked by frequency so that the common words are in the front and the rare words are in the back.
Best regards
Thank You
I am unable to understand , How vocab.txt generated any many words are assigned same integer value,Why not real value?