Description
After a merlin run there are several report files generated, but requires manual piecing together/analysis to provide useful "at a glance" information.
Proposed solution
Generate a consolidated report (potentially HTML/JSON) that contains the following (Crawler):
Total time taken
Total number of valid URLs detected
Number of URLs found per group
Duplicates found (with listing)
Server response errors (with associated URLs) - non 200/301/302 response
Generate a consolidated report (potentially HTML/JSON) that contains the following (Scraper):
Total time taken
Total number of pages scraped
Pages that failed mandatory checks (if any)
Server response errors (with associated URLs) - non 200/301/302 response
Total number of associated assets (documents, images)
Additional context
Useful as a part of bigger picture issue to give confidence a final migration contains everything it should
Description After a merlin run there are several report files generated, but requires manual piecing together/analysis to provide useful "at a glance" information.
Proposed solution Generate a consolidated report (potentially HTML/JSON) that contains the following (Crawler):
Generate a consolidated report (potentially HTML/JSON) that contains the following (Scraper):
Additional context Useful as a part of bigger picture issue to give confidence a final migration contains everything it should