stuckyb / ontopilot

15 stars 2 forks source link

unicode support #71

Closed stuckyb closed 7 years ago

stuckyb commented 7 years ago

It appears that there might be problems with unicode characters in excel documents. Unicode testing needs to be incorporated into the unit tests for all of the table reader classes, and extended character set support needs to be added, if necessary.

stuckyb commented 7 years ago

Fixed. OntoPilot now exclusively uses unicode strings internally, and all data reader classes return unicode strings and handle input unicode correctly (in the case of CSV files, it must be encoded as UTF-8).