ericleasemorgan / reader

Distant Reader, a tool for using & understanding a corpus
GNU General Public License v2.0
20 stars 7 forks source link

Create a semantic index #93

Open ericleasemorgan opened 4 years ago

ericleasemorgan commented 4 years ago

Given all the documents in a carrel's ./txt directory, create a semantic index using word2vec or GloVe. It does not matter which one. Save the resulting index file as ./etc/reader.vec. There is an example script in reader:/bin/carrel2vec.sh The stipulation is that the resulting file MUST be readable across operating systems.

archaeocharlie commented 4 years ago

Are you just looking for something like word2vec's keyed word vectors saved in the vec file or the entire model? @ericleasemorgan

ericleasemorgan commented 4 years ago

If I understand your question correctly, I'm looking for a something like word2vec's keyed word vectors saved to a .vec file. I believe, if just a file were distributed as a part of a study carrel, then we can also provide a script to query the file. I don't know what you mean when you say "the entire model".