ematvey / hierarchical-attention-networks

Document classification with Hierarchical Attention Networks in TensorFlow. WARNING: project is currently unmaintained, issues will probably not be addressed.
MIT License
463 stars 148 forks source link
deep-learning document-classification hierarchical-attention-networks machine-learning nlp tensorflow

Deep Text Classifier

Implementation of document classification model described in Hierarchical Attention Networks for Document Classification (Yang et al., 2016).

How to run

  1. Create a virtual environment, activate it, and install requirements:

    python3 -m venv env
    source env/bin/activate
    pip install -r requirements.txt
  2. Download the English model for spaCy:

python -m spacy download en
  1. Get Yelp review dataset and extract it in this directory.
    python3 yelp_prepare.py dataset/review.json
    python3 worker.py --mode=train --device=/gpu:0 --batch-size=30

Results

I am getting 65% accuracy on a dev set (16% of data) after 3 epochs. Results reported in the paper are 71% on Yelp'15. No systemic hyperparameter optimization was performed.