williamleif / histwords

Collection of tools for building diachronic/historical word vectors
http://nlp.stanford.edu/projects/histwords/
Apache License 2.0
420 stars 92 forks source link

Difficulties to use seq_procrustes.py with new embeddings #10

Open dufric opened 5 years ago

dufric commented 5 years ago

Hi,

I am currently experiencing some difficulties generating new embeddings with your code for visualizing words over time.

For now, I generated separared embeddings by year using sgns/hyperwords. Seems to be ok.

I know try to use your script vecanalysis/seq_procrustes.py, but I think I do not use the correct format for the needed count file: I suppose it's not the same than the one genetated in hyperwords? Maybe I missed it but is there any example of this file somewhere?

I downloaded the example "embeddings/eng-fiction-all_sgns" for visualisation (and it works), but could not find any count file.

Thank you for the answer.

Best regards.