Open leondz opened 12 years ago
I think this may be down to the encoding of the terminal being ASCII only? Not entirely sure... Bugfixes may be to (re?)open stdout in utf-8 mode if it can, will investigate a bit more when I have time
Well, if you feel like it! I'm putting these up partly as a note to myself to fix them - it just seems like the best place to keep bug reports
largely just throwing my own thoughts out there too tbh :)
From TAC_2010_KBP_Source_Data/data/2010/wb/eng-WL-11-174596-12957493.sgm (http://pastebin.com/Wz2QKEAZ):
Traceback (most recent call last): File "/usr/local/bin/annotate_timex", line 154, in
print str(doc)
UnicodeEncodeError: 'ascii' codec can't encode character u'\u201c' in position 662: ordinal not in range(128)