yoonkim / lstm-char-cnn

LSTM language model with CNN over characters
MIT License
826 stars 221 forks source link

UTF-8 support #8

Open vseledkin opened 9 years ago

vseledkin commented 9 years ago

This adds UTF-8 support for multibyte encoded corpora