ghostery / trackerdb

Ghostery Tracker Database
Other
79 stars 24 forks source link

[wallee]: #266

Closed 1204Yoshi closed 5 months ago

1204Yoshi commented 6 months ago

Request

https://www.google.com/recaptcha/api.js

Location

https://www.redgirasol.com/

Tracker Company

Google

Company Website

https://www.google.com/

Privacy Policy

No response

Describe the company

No response

Category

Site Analytics

Request Type

None

Additional Information

No response

Filters

No response

philipp-classen commented 6 months ago

With Ghostery 8, I see Google Analytics and Recaptcha requests (on https://www.redgirasol.com/rendimiento) listed under "wallee":

wallee

My expectation would have been that they are listed under Google Analytics and und Recaptcha. How, they match "wallee" is not clear to me. The entry is her: https://github.com/ghostery/trackerdb/blob/3ca21ccc9947bc1a7b6756ba912d7a8a2a4f8956/db/patterns/wallee.eno#L7

It only contains the domain "app-wallee.com", but I none of the requests have that domain.

Looks like a bug in Ghostery 8. (Tested with Ghostery 8 on Chromium)

1204Yoshi commented 6 months ago

image

philipp-classen commented 6 months ago

Thanks for finding that! It looks like there is a bug in our matching logic. We need to investigate.

philipp-classen commented 6 months ago

The trackerdb API (@ghostery/trackerdb@1.0.121) attributes it correctly:

npx @ghostery/trackerdb "https://region1.analytics.google.com/g/collect?v=2&tid=G-QLYB1PK7RN&_ng=1&gtm=45je44f0v885777672z8894232443za200&_p=1713378081823&gcd=13l3l3l2l1&npa=1&dma_cps=sypham&dma=1&cid=1609738265.1713374177&ul=en-us&sr=1920x1200&uaa=x86&uab=64&uafvl=Not-A.Brand%3B99.0.0.0%7CChromium%3B124.0.6367.49&uamb=0&uam=&uap=Linux&uapv=6.8.4&uaw=0&pscdl=noapi&_s=1&sid=1713377776&sct=2&seg=1&dl=https%3A%2F%2Fwww.redgirasol.com%2Frendimiento&dt=RedGirasol&en=page_view&tfd=679"

->

 {
  "url": "https://region1.analytics.google.com/g/collect?v=2&tid=G-QLYB1PK7RN&_ng=1&gtm=45je44f0v885777672z8894232443za200&_p=1713378081823&gcd=13l3l3l2l1&npa=1&dma_cps=sypham&dma=1&cid=1609738265.1713374177&ul=en-us&sr=1920x1200&uaa=x86&uab=64&uafvl=Not-A.Brand%3B99.0.0.0%7CChromium%3B124.0.6367.49&uamb=0&uam=&uap=Linux&uapv=6.8.4&uaw=0&pscdl=noapi&_s=1&sid=1713377776&sct=2&seg=1&dl=https%3A%2F%2Fwww.redgirasol.com%2Frendimiento&dt=RedGirasol&en=page_view&tfd=679",
  "matches": [
    {
      "pattern": {
        "key": "google_signals",
        "name": "Google Analytics with signals",
        "category": "advertising",
        "organization": "google",
        "alias": null,
        "website_url": "https://support.google.com/analytics/answer/9445345?hl=en#zippy=%2Cin-this-article",
        "ghostery_id": null,
        "domains": [
          "analytics.google.com"
        ],
        "filters": [
          "||analytics.google.com^$third-party",
          "||region1.analytics.google.com^$third-party"
        ]
      },
      "category": {
        "key": "advertising",
        "name": "Advertising",
        "color": "#cb55cd",
        "description": "Advertising services that utilize data collection, behavioral analysis, and user retargeting."
      },
      "organization": {
        "key": "google",
        "name": "Google",
        "description": "Google LLC is an American multinational technology company that specializes in Internet-related services and products, which include online advertising technologies, search engine, cloud computing, software, and hardware.",
        "website_url": "https://www.google.com",
        "country": "US",
        "privacy_policy_url": "https://www.google.com/intl/en/policies/privacy/",
        "privacy_contact": "https://www.google.com/contact/",
        "ghostery_id": "82"
      }
    }
  ]
}

And also

npx @ghostery/trackerdb "https://www.gstatic.com/recaptcha/releases/rz4DvU-cY2JYCwHSTck0_qm-/recaptcha__en.js"

->

{
  "url": "https://www.gstatic.com/recaptcha/releases/rz4DvU-cY2JYCwHSTck0_qm-/recaptcha__en.js",
  "matches": [
    {
      "pattern": {
        "key": "google_recaptcha",
        "name": "Google Recaptcha",
        "category": "essential",
        "organization": "google",
        "alias": null,
        "website_url": "https://www.google.com/recaptcha/about/",
        "ghostery_id": null,
        "domains": [
          "recaptcha.net"
        ],
        "filters": [
          "||recaptcha.net/recaptcha/",
          "||google.com/recaptcha/$frame,script",
          "||gstatic.*/recaptcha/"
        ]
      },
      "category": {
        "key": "essential",
        "name": "Essential",
        "color": "#fc9734",
        "description": "Site requests that may be critical to website functionality, such as tag managers and privacy notices."
      },
      "organization": {
        "key": "google",
        "name": "Google",
        "description": "Google LLC is an American multinational technology company that specializes in Internet-related services and products, which include online advertising technologies, search engine, cloud computing, software, and hardware.",
        "website_url": "https://www.google.com",
        "country": "US",
        "privacy_policy_url": "https://www.google.com/intl/en/policies/privacy/",
        "privacy_contact": "https://www.google.com/contact/",
        "ghostery_id": "82"
      }
    },
    {
      "pattern": {
        "key": "gstatic",
        "name": "Google Static",
        "category": "cdn",
        "organization": "google",
        "alias": null,
        "website_url": null,
        "ghostery_id": "3588",
        "domains": [
          "gstatic.com"
        ],
        "filters": []
      },
      "category": {
        "key": "cdn",
        "name": "CDN",
        "color": "#4ecba1",
        "description": "Content Delivery Networks (CDNs) distribute resources for different site utilities, and collect user analytics and tracking data."
      },
      "organization": {
        "key": "google",
        "name": "Google",
        "description": "Google LLC is an American multinational technology company that specializes in Internet-related services and products, which include online advertising technologies, search engine, cloud computing, software, and hardware.",
        "website_url": "https://www.google.com",
        "country": "US",
        "privacy_policy_url": "https://www.google.com/intl/en/policies/privacy/",
        "privacy_contact": "https://www.google.com/contact/",
        "ghostery_id": "82"
      }
    }
  ]
}

Yet in the Ghostery Extensions, the requests ended up being listed as wallee.

philipp-classen commented 6 months ago

A fix should come with the next release of Ghostery (https://github.com/ghostery/ghostery-extension/pull/1544). I can confirm locally, that it fixes the problem for me.

cookiefirst commented 6 months ago

Our consent management software Called CookieFirst has also been classified als Walleee and not as the tracker that we have submitted before

CleanShot 2024-05-01 at 13 17 04@2x

philipp-classen commented 6 months ago

@cookiefirst Indeed, we need to look into it again. On a production profile, I also still see "wallee".

Either the fix is not released yet, or it is not working for a different reason.

philipp-classen commented 6 months ago

Hit us now also in another way: recaptcha was missclassified as "wallee" and thus started to get blocked, causing more sites to break.

The concrete case of recaptcha being blocked should be solved now by https://github.com/ghostery/broken-page-reports/pull/624 and https://github.com/ghostery/trackerdb/pull/283. But their may be other unintended side-effects of the misclassification.

philipp-classen commented 6 months ago

The fix is included in the upcoming Ghostery release 8.12.10 (https://github.com/ghostery/ghostery-extension/pull/1576).

1204Yoshi commented 6 months ago

image

Maybe depend on the kind of site

chrmod commented 6 months ago

Thanks to your report, we have successfully identified and fixed the problem with incorrect attribution of requests to wrong trackers. The fix is on a way as a part of Ghostery 8.12.10

Thank you for all the effort you've put to help improve Ghostery, it is much appreciated!

philipp-classen commented 5 months ago

Firefox is still pending release (being in review); but on the other platforms (e.g. Chrome, Edge, Opera), I can confirm that the release fixed the problem in my local profiles.

philipp-classen commented 5 months ago

Firefox has been also updated now.