hoarder-app / hoarder

A self-hostable bookmark-everything app (links, notes and images) with AI-based automatic tagging and full text search
https://hoarder.app
GNU Affero General Public License v3.0
2.25k stars 74 forks source link

Function suggestions, webpage snapshot #229

Open hongruilin opened 1 week ago

hongruilin commented 1 week ago

Sometimes we encounter such a situation: we suddenly see an interesting article and want to read it after work, but when we have time to read it after work, the article has already been deleted. If there is a web snapshot that can be turned on or off according to user needs, when it is turned on, it can cache a bookmarked page, so that even if the original article is deleted, it can still be viewed. This seems very useful

kamtschatka commented 1 week ago

There are multiple options already:

The configuration can be seen here: https://docs.hoarder.app/configuration

Is your suggestion to any of the available suggestions?

hongruilin commented 1 week ago

There are multiple options already:

  • Extracting the content. Does not work perfectly all the time
  • Taking a screenshot. Some articles need scrolling, so it is possible to enable full page screenshot (as in: while taking the screenshot it will scroll all the way down). See CRAWLER_STORE_SCREENSHOT and CRAWLER_FULL_PAGE_SCREENSHOT
  • Saving an archive: This stores the full page as an archive for you to load later. See CRAWLER_FULL_PAGE_ARCHIVE

The configuration can be seen here: https://docs.hoarder.app/configuration

Is your suggestion to any of the available suggestions?

I hope to cache only some label copies

kamtschatka commented 1 week ago

I am sorry, I don't understand what that means, can you please go into more details or show an example or something like that?

hongruilin commented 1 week ago

I am sorry, I don't understand what that means, can you please go into more details or show an example or something like that?

For example, there are a total of 4 bookmarks, and we can choose to cache only the web page copy of bookmark a