Closed ivan closed 6 years ago
tumblr images too
This seems similar to https://github.com/ludios/grab-site/issues/88 to me...
Similarly, pages with query strings could also be queued with the query string removed (e.g. /index.php?foo=bar
becoming /index.php
).
I tried implementing this in branch get-urls-hook, but when wpull gets to the :orig
URL in the queue, it just puts it into the skipped state, for reasons unknown to me.
URLs were getting skipped because of grab-site's --no-parent
combined with the lack of inline=True
in the hook.
Implemented for Twitter and Quora in 0ea3d4093860ac526ea5e2d8c591ea31df3ccd44.
I didn't want to deal with Flickr (it looks like it uses a different secret for the original image anyway), but I would take a PR for it (to get the largest non-original image?).
Thank you @ivan! 👏👏👏
Are there any other obvious URLs to additionally queue when we see certain URLs on various websites? Suggestions welcome.