ukwa / w3act

w3act is an annotation and curation tool for building web archive collections
Apache License 2.0
19 stars 6 forks source link

Crawl log link from ACT is not completely functional #670

Closed emacglone closed 1 year ago

emacglone commented 2 years ago

Links to crawl logs open, but do not resolve to anything for a while, before returning a 504 Gateway Time-out.

anjackson commented 2 years ago

That older systems is being replaced with something that will hopefully be more reliable. Linking from W3ACT will be implemented in #667

It's not super easy to use directly, but for example, can be filtered by web host:

https://www.webarchive.org.uk/act/grafana/d/67xk-317z/recent-crawler-activity?orgId=1&refresh=1m&var-Filters=crawled.host.keyword%7C%3D%7Cwww.bl.uk&from=now-2d&to=now

anjackson commented 1 year ago

Closing this as the #667 ticket covers the solution.