Open abdullahkhilji opened 4 years ago
I have created the dictionary as created by GLoVe it works but takes a lot of time, is it required to keep a bar on the number of words in the dictionary? Else it consumes a lot of time.
As introduced in error, you should keep the format of the dictionary as
A 10000
B 10000
The value of cnt is no matter, but it must be provided.
I was following the same format.
The error got fixed after I reduced the size of dict.en.txt
it was around 800MB. Reducing the file below 10MB after considering the fine tune data only worked. Will have to set a threshold for a better solution.
I have matched the dictionary generated using XLM code and the sample given here at MASS, though the format matches it still gives an error.