Counter-fitting glove vectors with a much larger vocab (~2 million words) the number of required dot products for computing VSP pairs obviously explodes (2m^2 vs 50k^2). The included pruned vocab of 50k only takes ~1min on a 4 core machine. What type of time frame did your full glove vocab take? I feel like I must be missing something.
First, thanks so much for posting your code!
Counter-fitting glove vectors with a much larger vocab (~2 million words) the number of required dot products for computing VSP pairs obviously explodes (2m^2 vs 50k^2). The included pruned vocab of 50k only takes ~1min on a 4 core machine. What type of time frame did your full glove vocab take? I feel like I must be missing something.
Thanks again Nikola!