Closed lfmatosm closed 3 years ago
Thank you for your work! :) I just added a few comments to address before merging
Thanks for the feedback, @silviatti! I've added docstrings for the changed methods, at your request.
Hi @silviatti . Updated the PR with a commit adding support for headerless original word2vec embeddings file in textual format. This feature provides support for the Dieng's pretrained embeddings found here, as requested by cayaluke's comment on #32. Previously, my PR supported only regular files including the header.
Resolves #32, #37.
Adds support for original-formatted word2vec and
gensim.models.KeyedVectors
-formatted embeddings files for ETM model training. Updates some details on theCONTRIBUTING
section of the documentation.