jakopako / goskyr

A configurable command-line web scraper written in go with auto configuration capability
GNU General Public License v3.0
33 stars 5 forks source link

Encoding bug? #211

Closed jakopako closed 3 months ago

jakopako commented 1 year ago

2023/04/04 15:30:31 https://www.elbphilharmonie.de/de/programm/TICKETS/ ERROR: error while parsing field date: parsing time "19:30\xc2\xa0Uhr Wed, 5.4.2023 " as "15:04 Uhr Mon, 2.1.2006 ": cannot parse "\xc2\xa0Uhr Wed, 5.4.2023 " as " Uhr ". Skipping item map[title:franz ensemble].

jakopako commented 3 months ago

from https://stackoverflow.com/questions/32419541/python-html-encoding-xc2-xa0

\xc2\xa0 means 0xC2 0xA0 is so-called

Non-breaking space

jakopako commented 3 months ago

working around this issue for now by just extracting the time, without Uhr.