jhlau / doc2vec

Python scripts for training/testing paragraph vectors
Apache License 2.0
644 stars 192 forks source link

The repository contains some python scripts for training and inferring test document vectors using paragraph vectors or doc2vec.

Requirements

Pre-Trained Doc2Vec Models

Pre-Trained Word2Vec Models

For reproducibility we also released the pre-trained word2vec skip-gram models on Wikipedia and AP News:

Directory Structure and Files

Model Hyper-Parameter Explanation

Publications