sissbruecker / linkding

Self-hosted bookmark manager that is designed be to be minimal, fast, and easy to set up using Docker.
https://linkding.link/
MIT License
6.78k stars 322 forks source link

Allow saving HTML snapshots of Cloudflare-Protected Websites #712

Open tastycroissant opened 7 months ago

tastycroissant commented 7 months ago

Feature Request: Integration with Flaresolverr for Archiving Cloudflare-Protected Websites

Description: Currently, Linkding offers a convenient way to archive links in HTML format for revisiting later. However, a significant number of websites are protected by Cloudflare, preventing Linkding from capturing snapshots of these pages. This limitation hampers the completeness of the archiving process, especially considering the widespread use of Cloudflare protection.

Proposal: I suggest integrating Flaresolverr into Linkding to overcome this issue. Flaresolverr is a service designed specifically to bypass Cloudflare protection, allowing access to websites that would otherwise be inaccessible due to Cloudflare's security measures. By incorporating Flaresolverr into Linkding's archiving process, users would gain the ability to capture snapshots of Cloudflare-protected websites seamlessly.

Benefits:

Implementation: The integration with Flaresolverr could be implemented as an optional feature within Linkding's settings. Users could enable or disable the Flaresolverr integration based on their preferences. Additionally, clear documentation and user guidance should be provided to ensure ease of use and understanding.

Thank you for considering this feature request.

sissbruecker commented 7 months ago

This is definitely an issue, and integrating with a solution like this could be an option. Then again it looks like it would need to be a more involved setup, as flaresolverr would ideally run in its own Docker container. At least from a quick look they rely on a specific Chrome / Chromium version that might not be compatible with what linkding installs, and then they don't support ARM architectures with standalone installs / builds, which linkding needs.

Another option could be to make this work through the browser extension somehow. When the browser extension runs you already have passed all Cloudflare checks, so there might be an option to just get the HTML though the extension and then make single-file work with that instead of trying to load the HTML through the URL on its own. Maybe it would also be possible to integrate the single-file browser extension with the linkding extension so that the full snapshot can already be captured in the browser.

Edit: Modified the title to reflect the problem that needs solving, rather than one of the possible options to solve it.

timthinks commented 2 weeks ago

This would be really great! Is there any update on this? Thank you for all the work!!!!