gleanerio / gleaner

Gleaner: JSON-LD and structured data on the web harvesting
https://gleaner.io
Apache License 2.0
17 stars 10 forks source link

Headless bad context's cause documents to not be stored #220

Open valentinedwv opened 1 year ago

valentinedwv commented 1 year ago

Headless might fix contexts, or do we need to do more than just test if it is valid JSON?

Headless is valid

json method

causing zero data loading

{"contentType":"script[type='application/ld+json']","file":"/github/workspace/internal/summoner/acquire/acquire.go:245","func":"github.com/gleanerio/gleaner/internal/summoner/acquire.FindJSONInResponse.func1","level":"error","msg":"Error processing script tag in https://data.ucar.edu/dataset/flight-tracks-google-earth-kml-files26error checking for valid json: Error in JSON-LD to RDF call: keyword redefinition: @type","time":"2023-06-27T12:15:54-05:00","url":"https://data.ucar.edu/dataset/flight-tracks-google-earth-kml-files26"}