nextdns / blocklists

77 stars 28 forks source link

The Quantum Ad-List #44

Open M86xKC opened 3 years ago

M86xKC commented 3 years ago

Add support for The Quantum Ad-List in NextDNS blocklists.

About The Quantum Ad-List

Made an AI to track and analyse every websites, a bit like a web crawler, to find and identify ads. It is a list containing over 1300000 domains used by ads, trackers, miners, malwares, and much more! It is specifically designed for hosts file, but can also be used with ad-blockers with the ad-blocker "optimized" variant.


All in one list: https://gitlab.com/The_Quantum_Alpha/the-quantum-ad-list/-/raw/master/For%20hosts%20file/The_Quantum_Ad-List.txt


Individual lists:

Abuse list: https://gitlab.com/The_Quantum_Alpha/the-quantum-ad-list/-/raw/master/Individual%20lists/The_Quantum_Abuse-List.txt

Privacy list: https://gitlab.com/The_Quantum_Alpha/the-quantum-ad-list/-/raw/master/Individual%20lists/The_Quantum_Privacy-list.txt

Generic ad list: https://gitlab.com/The_Quantum_Alpha/the-quantum-ad-list/-/raw/master/Individual%20lists/The_Quantum_Simply-ads-list.txt

YouTube ad list: https://gitlab.com/The_Quantum_Alpha/the-quantum-ad-list/-/raw/master/Individual%20lists/The_Quantum_Youtube-Ads-List.txt

ghost commented 3 years ago

Actually i'm for but i will Add in description a very big Warning, i have tested these list in the past and they have many false positive for now, i have removed it because too many good website was blocked

crssi commented 3 years ago

@michaelb-ae What are you talking about? If I see it correctly those lists started 2 week ago and I do not see any issue you would post there about any breakages.

ghost commented 3 years ago

i have tested it and i don't know why but immediately login.live.* and other major domain was blocked (and on AdGuard Home) he show the list who is the one who allow or block the domain, and all breakage comes from the quantum adlist, maybe a bad version or a mistake during the generation of the list ? i don't know but before really ad any list, i think more than 2 weeks of monitoring must be done.

edit : after checking the gitlab i also see these list don't have been updated since two weeks ago, so we don't even know if it's a dynamic list (frequently updated) or a static list. image

crssi commented 3 years ago

Well, login.live.com opens just fine here. Stop telling us and open an issue there where it should be, https://gitlab.com/The_Quantum_Alpha/the-quantum-ad-list/-/issues

The-Quantum-Alpha commented 3 years ago

I have indeed fixed several things with the AI, give it a shot once again!!

(The main aggregated list, not the individuals that aren't unpdated yet)

The-Quantum-Alpha commented 3 years ago

i have tested it and i don't know why but immediately login.live.* and other major domain was blocked (and on AdGuard Home) he show the list who is the one who allow or block the domain, and all breakage comes from the quantum adlist, maybe a bad version or a mistake during the generation of the list ? i don't know but before really ad any list, i think more than 2 weeks of monitoring must be done.

edit : after checking the gitlab i also see these list don't have been updated since two weeks ago, so we don't even know if it's a dynamic list (frequently updated) or a static list. image

image

crssi commented 3 years ago

@romaincointepas is there any ETA for addition of this list?

Thank you and cheers ❤️

The-Quantum-Alpha commented 3 years ago

The individual lists have been updated by her!

Should now be better in every shape and forms!

Can also tip us if you want! :upside_down_face:

The-Quantum-Alpha commented 3 years ago

tqal epic

We are hardcore...

crssi commented 3 years ago

humble bump 😄

The-Quantum-Alpha commented 3 years ago

For updates and other spam info , https://fosstodon.org/@The_Quantum_AdList

ghost commented 3 years ago

@The-Quantum-Alpha how does she manage (i don't ask code but to know if these is many false positive or not) the false positive (does she scan manual white list from other public project to distinguish false positive), i don't really want to maintain a veeeery long manual whitelist ?

The-Quantum-Alpha commented 3 years ago

@Macqael indeed we are a team! She helps me, I help her. Please do not target a specific individual as the responsible.

The way whitelist are managed is all artificial, and we only give 2% of maximal error margin. The AI must be 98% certain of it actions, thus we allow down to 95%, but everything that fall in that range are manually verified.

The AI is part of the most advanced AI in the world yet, it is no small thing.

I would say that if a domain is blocked, there is a strong reason behind it.

Thank you for your questioning!