hoarder-app / hoarder

A self-hostable bookmark-everything app (links, notes and images) with AI-based automatic tagging and full text search
https://hoarder.app
GNU Affero General Public License v3.0
6.48k stars 235 forks source link

Crawl / store pages that are behind a login #607

Closed Ronaldvr closed 3 weeks ago

Ronaldvr commented 3 weeks ago

Describe the feature you'd like

It seems possible if I search the web: I have several pages that are behind a login, and these cannot be crawled without providing the necessary credentials.

Describe the benefits this would bring to existing Hoarder users

You can store and keep the content even when you login expires, lapses, or the site itself stops

Can the goal of this request already be achieved via other means?

Perhaps: The previously mentioned SingleFile can be used, but then hoarder has to support those html files as input.

Have you searched for an existing open/closed issue?

Additional context

No response

MohamedBassem commented 3 weeks ago

Duplicate of #172