Open antitoxic opened 8 years ago
Having the same issue. CSV file is in UTF-8, so it shouldn't be an issue. I suspect the issue is that headers are partially ascii, partially utf-8. Need to check
For Python 2, I guess you have to fix str(header)
to header.encode('utf-8')
.
- headers = [str(header) for header in headers]
+ headers = [header.encode('utf-8') for header in headers]
@amercader we (the civic hackers pushing the Bulgarian opendata portal) are closely following your work. And thank you. For the datapusher and the other tools you're working on.
We have a show-stopper problem for using the datapusher and it's quite common. It could be the data. It might be strangely formatted or simply because it's cyrillic. Please give us a hit.
This is the dataset the error occurs: http://opendata.obshtestvo.bg/dataset/spisak-na-razprostranitelite-ne-vinetni-stikeri
This is our staging server. You can play about and not worry about data or crushing.
This is what we get in the DataStore tab:
with:
Btw, what is "Determined headers and types". It's giving also giving us this:
The escaped version of the "Determined headers and types" is:
Seems like the CSV is encoded in Windwows-1251. Is this the problem? Interesting thing is that that data in "Determined headers and types" is actually line number 693. Why is this line determined as headers?
This is the stack: