senderle / topic-modeling-tool

A point-and-click tool for creating and analyzing topic models produced by MALLET.
https://senderle.github.io/topic-modeling-tool/documentation/2017/01/06/quickstart.html
Apache License 2.0
106 stars 22 forks source link

Document UTF-8 corner cases #53

Open senderle opened 7 years ago

senderle commented 7 years ago

If you want to have UTF-8 support on Windows machines (and maybe others), you'll need to use the native app, or you'll have to add -Dfile.encoding=UTF-8 to your JAVA_TOOL_OPTIONS environment variable. This should be documented clearly, probably in the quick start guide, and certainly in the longer documentation... once it exists.