lethienhoa / DenseNet-NLP

Tensorflow implementation of Very Deep Convolutional Networks for Natural Language Processing
79 stars 22 forks source link

Very Deep Convolutional Networks for Natural Language Processing in Tensorflow

This is the DenseNet implementation of the paper Do Convolutional Networks need to be Deep for Text Classification ? in Tensorflow. We study in the paper the importance of depth in convolutional models for text classification, either when character or word inputs are considered. We show on 5 standard text classification and sentiment analysis tasks that deep models indeed give better performances than shallow networks when the text input is represented as a sequence of characters. However, a simple shallow-and-wide network outperforms deep models such as DenseNet with word inputs. Our shallow word model further establishes new state-of-the-art performances on two datasets: Yelp Binary (95.9\%) and Yelp Full (64.9\%).

Paper:

Hoa T. Le, Christophe Cerisara, Alexandre Denis. Do Convolutional Networks need to be Deep for Text Classification ?. Association for the Advancement of Artificial Intelligence 2018 (AAAI-18) Workshop on Affective Content Analysis. (https://arxiv.org/abs/1707.04108)

@article{DBLP:journals/corr/LeCD17,
  author    = {Hoa T. Le and
               Christophe Cerisara and
               Alexandre Denis},               
  title     = {Do Convolutional Networks need to be Deep for Text Classification ?},  
  journal   = {CoRR},  
  year      = {2017}  
}

Results:

Reference Source Codes: https://github.com/dennybritz/cnn-text-classification-tf