nschneid / pysupersensetagger

AMALGrAM, an English supersense tagger written in Python
GNU General Public License v3.0
33 stars 12 forks source link

tags2sst breaking #16

Closed muranava closed 9 years ago

muranava commented 9 years ago

hi

getting this message when running tags2sst:


Traceback (most recent call last): File "src/tags2sst.py", line 203, in convert(fileinput.input(args), labelsInRenderedAnno=labelsInRenderedAnno) File "src/tags2sst.py", line 191, in convert readsent() File "src/tags2sst.py", line 186, in readsent ({int(k): v[1] for k,v in data["labels"].items()} if labelsInRenderedAnno else {})), File "src/tags2sst.py", line 103, in render return u''.join(sum(zip(before,ww,labelafter,after), ())).strip()

UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 0: ordinal not in range(128)

i get the .tags file generated correctly and the tags2sst works on the example file

muranava commented 9 years ago

hi removing u in line 103 seemed to have solved this issue

nschneid commented 9 years ago

Thanks. This is a reminder to myself to fix it in the repository.

nschneid commented 9 years ago

@muranava, does the version at https://github.com/nschneid/pysupersensetagger/blob/master/src/tags2sst.py fix the problem?

muranava commented 9 years ago

hi great thanks seems to work and the output is cleaner to work with as well for novices like me :)

 On Tuesday, 14 April 2015, 11:17, nschneid <notifications@github.com> wrote:

@muranava, does the version at https://github.com/nschneid/pysupersensetagger/blob/master/src/tags2sst.py fix the problem?— Reply to this email directly or view it on GitHub.