Closed aosingh closed 6 years ago
I updated the code to force TransormerCLI output to the Unicode character set.
Also, you can force java to default to Unicode output by setting file.encoding, for example:
java -Dfile.encoding=UTF-8
Let me know if this fixes the problem.
Sorry for the late feedback, but adding
java -Dfile.encoding=UTF-8
to the parser script solved the issue.
For example consider the following inventor names. The inventor last-name has an HTML hex for the entity ø
They get serialized to