yohasebe / engtagger

English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagger
GNU General Public License v2.0
263 stars 48 forks source link

Switch toutf8 to encode to work correctly with already-UTF-8 strings #2

Closed djudd closed 2 years ago

djudd commented 11 years ago

Try EngTagger.new.add_tags("González-Bailón") - for me, in ruby 1.9.3p385, it was producing a tag with the contents "Gonz叩lez-Bail坦n"