asu-ke-web-services / text-geolocator

Generating heatmaps from and searching for geolocation data from text documents (Fall 2014/Spring 2015 CS Capstone)
MIT License
0 stars 0 forks source link

Handle Unicode or UTF-8 Character encoding #34

Closed rraub closed 9 years ago

rraub commented 9 years ago

Example: https://github.com/gios-asu/text-geolocator/blob/ivan-add-documentation/test-data/2-extracted-text/Vicca_etal_2014_Biogeosciences_0.txt

jcworkma commented 9 years ago

Unicode and UTF-8 encodings are now supported. There was a flag in the init method of the Stanford NER Tagger where you could specify what encoding to use. I set it to 'utf-8'.