jakopako / goskyr

A configurable command-line web scraper written in go with auto configuration capability
GNU General Public License v3.0
32 stars 4 forks source link

field supposedly empty?! #280

Closed jakopako closed 4 months ago

jakopako commented 4 months ago

When scraping Huxleysneuewelt (see https://github.com/jakopako/croncert-config/blob/main/config/berlin.yml#L48 ) we have a lot of

2024/02/24 16:48:08 HuxleysNeueWelt ERROR: error while parsing field title: field title cannot be empty. Skipping item map[city:Berlin country:Germany location:Huxleys Neue Welt sourceUrl:https://huxleysneuewelt.com/shows type:concert].

Investigate why. At first sight I couldn't see anything wrong with the config or the website that would lead to those error messages.

jakopako commented 4 months ago

Not a bug. The value of the HuxleysNeueWelt scraper's item was wrong and led to items with sub html trees that had nothing in them apart from an a node.