Transparency-Consent-Framework-Research / consent-crawler

Crawlee/Playwright based crawler for collecting collecting TCF signals from websites
MIT License
2 stars 1 forks source link

WIP: Feature: CMP Handler V2 #4

Open antoniojtorres opened 1 year ago

antoniojtorres commented 1 year ago

Replaces the old banners array with CMP handling code with dedicated handler functionality that allows for better specificity when dealing with banner variants for a given CMP. Adds a straight forward hook based system with individual CMP handler files that can be added and unit tested more efficiently.

There's a lot of work to do to test reliability and ensure consistent handling across hundreds of thousands of publishers.

CMP Link Status
Quantcast Website 🔴
Civic Website 🟡
Cookiebot Website 🟡
Cookie Information Website 🟡
Didomi Website 🟡
MIS GmbH Website 🟡
Ogury Website 🟡
OneTrust Website 🟡
ShareThis Website 🟡
ShinyStat Website 🟡
Sibbo Website 🟡
Transfon Website 🟡
TrustArc Website 🟡

🟢 = Tested and Ready 🟡 = Awaiting Testing 🔴 = Working on known issue

Looking for at least 10 passing examples per CMP