EFForg / badger-sett

Automated training for Privacy Badger. Badger Sett automates browsers to visit websites to produce fresh Privacy Badger tracker data.
https://www.eff.org/badger-pretraining
MIT License
119 stars 13 forks source link

Consider creating a temporary skip list of sites #83

Closed ghostwords closed 7 months ago

ghostwords commented 7 months ago

To further speed up and optimize tracker discovery.

We could one-time or time-limited skip DNS or other errors and sites with no trackers.

From https://github.com/EFForg/badger-sett/issues/79#issuecomment-1932265307.

ghostwords commented 7 months ago

We'll now try skipping the last week's worth of errored out domains (this includes security pages that we detect) when building the site list. If this helps, will follow up with fixes and/or improvements as necessary.

ghostwords commented 7 months ago

Combined changeset: https://github.com/EFForg/badger-sett/compare/0f6e452c2e9d1dde144244f22d16e9b9ec80d713%5E...8ec19055f2bfc3071015825d1aae3e5500d7988c

ghostwords commented 7 months ago

Also, f2450c0e66b3d0cc79c06f185ae477f5e99c5162 and ceab09f78b75d81ae5a113eeb03301ac8cc14278.