salsadigitalauorg / merlin-framework

Merlin - migration framework
GNU General Public License v3.0
17 stars 3 forks source link

When there are redirects to external site, the effective URLs file still saves the external URL as relative #138

Closed gargsuchi closed 3 years ago

gargsuchi commented 3 years ago

Describe the bug When I crawl a site, and I see redirects to an external site - something like this

url_original: https://4503abd5-06ad-470e-b766-9e7f35f6551c.sites.quantcdn.io/about/key-staff/chief-paramedic-officer url_effective: https://www.bettersafercare.vic.gov.au/about-us/about-scv/our-leadership-team/adj-assoc-prof-alan-eade-asm

The effective URLs site lists the URL as /about-us/about-scv/our-leadership-team/adj-assoc-prof-alan-eade-asm, even though this URL does not exist on the original site.

derklempner commented 3 years ago

This one has been fixed in the feature/group-type branch.