gleanerio / gleaner

Gleaner: JSON-LD and structured data on the web harvesting
https://gleaner.io
Apache License 2.0
17 stars 10 forks source link

Log bad or duplicate identifiers to a file #252

Open valentinedwv opened 6 months ago

valentinedwv commented 6 months ago

example from r2r: @id: doi:null

This is intentional on their part. These are not 'published' need to confirm.

Occurs on a large number of files.

sitemap_count": 53386,
  "summoned_count": 44401,
  "missing_sitemap_summon_count": 9009,

so need a :