mwilliamson / python-mammoth

Convert Word documents (.docx files) to HTML
BSD 2-Clause "Simplified" License
811 stars 121 forks source link

character conversion issue? #60

Closed paklein closed 6 years ago

paklein commented 6 years ago

docx contains the Euro symbol: UnicodeEncodeError: 'ascii' codec can't encode character '\u2013' in position 39572: ordinal not in range(128)

mwilliamson commented 6 years ago

Could you post a full stack trace and an example document?

Could you also post how you're invoking Mammoth, either on the command line (if using the CLI) or in code (if using it as a library)?

GitBruno commented 6 years ago

I don't think this is an issue, but rather an issue of implementation. See similar issues on Stackoverflow. Said that, happy to write a unicode test to close this issue.

mwilliamson commented 6 years ago

Closing since there's not much I can do without further information.

paklein commented 6 years ago

Sorry for the long delay. Example attached. test.docx