turbomaze / word2vecjson

This project represents the 300-dimensional word vectors from word2vec as JSON.
117 stars 34 forks source link

Word vectors source (Question) #2

Open shiffman opened 6 years ago

shiffman commented 6 years ago

Hello!

I'm using your code to teach about word vectors at http://itp.nyu.edu and make some examples with http://p5js.org. Just curious about the original 1.6 GB bin file you downloaded with word vectors -- what is its source?

Great work and thank you!

turbomaze commented 6 years ago

Hi there,

Glad to hear you're finding this useful! The vectors come from this link https://code.google.com/archive/p/word2vec/, specifically the archive labeled "GoogleNews-vectors-negative300.bin.gz". You can read more about them there.

Hope this helps, Anthony

kostasx commented 3 years ago

Hello. It would be nice to provide a rough overview of the process that you followed to convert the original vector dataset to a JSON file.