Open philipp-classen opened 4 months ago
Example request that gets missclassified (should be Google Recaptcha):
https://www.google.com/recaptcha/enterprise/anchor?ar=1&k=6LeBpaMpAAAAAAfN9Uz3t0FX0Joj88i7kXwYoHFc&co=aHR0cHM6Ly90cmFuc2Zvcm13by5jb206NDQz&hl=en&v=vjbW55W42X033PfTdVf6Ft4q&size=invisible&cb=12i59iyptwhn
But it is currently matched as:
{ "url": "https://www.google.com/recaptcha/enterprise/anchor?ar=1&k=6LeBpaMpAAAAAAfN9Uz3t0FX0Joj88i7kXwYoHFc&co=aHR0cHM6Ly90cmFuc2Zvcm13by5jb206NDQz&hl=en&v=vjbW55W42X033PfTdVf6Ft4q&size=invisible&cb=12i59iyptwhn", "matches": [ { "pattern": { "key": "google", "name": "Google", "category": "advertising", "organization": "google", "alias": null, "website_url": "https://www.google.com/", "ghostery_id": "3579", "domains": [ "google.at", "google.be", "google.ca", "google.ch", "google.co.id", "google.co.in", "google.co.jp", "google.co.ma", "google.co.th", "google.co.uk", "google.com", "google.com.ar", "google.com.au", "google.com.br", "google.com.mx", "google.com.tr", "google.com.tw", "google.com.ua", "google.cz", "google.de", "google.dk", "google.dz", "google.es", "google.fi", "google.fr", "google.gr", "google.hu", "google.ie", "google.it", "google.nl", "google.no", "google.pl", "google.pt", "google.ro", "google.rs", "google.ru", "google.se", "google.tn" ], "filters": [] }, "category": { "key": "advertising", "name": "Advertising", "color": "#cb55cd", "description": "Advertising services that utilize data collection, behavioral analysis, and user retargeting." }, "organization": { "key": "google", "name": "Google", "description": "Google LLC is an American multinational technology company that specializes in Internet-related services and products, which include online advertising technologies, search engine, cloud computing, software, and hardware.", "website_url": "https://www.google.com", "country": "US", "privacy_policy_url": "https://www.google.com/intl/en/policies/privacy/", "privacy_contact": "https://www.google.com/contact/", "ghostery_id": "82" } } ] }
This request is triggered by recaptcha script and also gets classified as Google https://www.google.com/js/bg/R158mP-HER8cF-2W1d4Zs3A-8309t2iBf9rXxsmuGOY.js
Google
Example request that gets missclassified (should be Google Recaptcha):
https://www.google.com/recaptcha/enterprise/anchor?ar=1&k=6LeBpaMpAAAAAAfN9Uz3t0FX0Joj88i7kXwYoHFc&co=aHR0cHM6Ly90cmFuc2Zvcm13by5jb206NDQz&hl=en&v=vjbW55W42X033PfTdVf6Ft4q&size=invisible&cb=12i59iyptwhn
But it is currently matched as: