RangerMauve / distributed-press-clone-site

Clone an existing website to Distributed Press using wget2
MIT License
1 stars 0 forks source link

External links get routed to root #3

Open benhylau opened 1 week ago

benhylau commented 1 week ago

This Newsletter links to home, while others on the menu work fine.

Screen Shot 2024-09-10 at 4 43 53 PM

Same thing happens in footnote.

image

This is likely bc this links to a subdomain: https://touchgrass.fightforthefuture.org/

Similarly, these "Recent Projects" also don't work, they all link back to home. But in the official site they actually all link to ext domains (not the Fight domain or its subdomains). So the script somehow replaced those ext domains with the IPNS URL.

Screen Shot 2024-09-10 at 4 46 56 PM

I think there needs to be a list of domains that get crawled and replaced, but if it's not on that list, the script should just not touch it.

RangerMauve commented 1 week ago

We may have success by specifying the --domains flag in wget. Will test tomorrow based on this SO answer: https://unix.stackexchange.com/a/110404