hoarder-app / hoarder

A self-hostable bookmark-everything app (links, notes and images) with AI-based automatic tagging and full text search
https://hoarder.app
GNU Affero General Public License v3.0
6.36k stars 227 forks source link

Bypassing cookie and GDPR banner #414

Open Dwelled2593 opened 1 month ago

Dwelled2593 commented 1 month ago

When i use hoarder on a youtube link, the crawler get stuck with the cookie banner, any idea on how to solve this ?

image

CrypticC3s4r commented 1 month ago

@Dwelled2593 can you provide some example links ?

pix commented 1 month ago

It's a Cookies / GDPR notice:

image

From: https://www.youtube.com/watch?v=E-5b1iGNraM (Probably a EU thing)

vhsdream commented 3 days ago

Can confirm this also happens with other types of sites that have notices a human has to click - for instance this article from the New York Times.

As a slightly humorous (and kind of irritating) aside, when I tried to ask my local LLM (Llama3.2) to summarize the article, this was it's response:

NYT

Edit: forgot to show what that news link appears as:

image