hl2guide / Filterlist-for-AdGuard-or-PiHole

A very aggressive filter-list that consolidates over 370 lists for use in AdGuard Home, Pi-Hole or similar.
MIT License
371 stars 52 forks source link

Invalid domains (illegal characters) #4

Closed p1r473 closed 4 years ago

p1r473 commented 4 years ago

custom_blocklist.txt:

*.local
*.sublimerevenue.com
*.workgroup
adserver-*.adtech.advertising.com
brianloancapital@gmail.com
sales01@pacpackagings.com
track*.marinsm.com

typosquatting_blocklist.txt:

0ffcloud.com

filter_blocklist.txt:

0.r.msn.com
(()|(.))ads.roku.com
(()|(.))adsdk.
(()|(.))adserv.
(()|(.))analytic.
(()|(.))logs.roku.com
(()|(.))metric.
(()|(.))telemetry.
(()|(.))tracking.
(.*)\.g0[0-9]\.(.*)
(.*)\.g00\.(.*)
(|\.)br.com$
(|\.)cn.com$
(|\.)co.com$
(|\.)co.in$
(|\.)com.cn$
(|\.)com.co$
(|\.)com.mx$
(|\.)de.com$
(|\.)eu.com$
(|\.)firm.in$
(|\.)gen.in$
(|\.)gr.com$
(|\.)ind.in$
(|\.)jpn.com$
(|\.)me.uk$
(|\.)net.cn$
(|\.)net.in$
(|\.)net.nz$
(|\.)org.in$
(|\.)org.nz$
(|\.)radio.am$
(|\.)radio.fm$
(|\.)ru.com$
(|\.)sa.com$
(|\.)se.net$
(|\.)uk.com$
(|\.)uk.net$
(|\.)xn--.*$
(|\.)za.com$
(ads|captive|cloudservices|logs).roku.com$
*.2mdn.net
*.adinfuse.com
*.admob.com
*.admob.xiaomi.com
*.adnxs.com
*.ads.linkedin.com
*.ads.mojiva.com
*.ads.mp.mydas.mobi
*.adswizz.com
*.advertising.com
*.amazon-adsystem.com
*.appads.com
*.applift.com
*.applovin.com
*.applvn.com
*.atdmt.com
*.audio2.spotify.com
*.batmobi.net
*.batmobil.net
*.bpmonline.com
*.comscore.com
*.doublecklick.net
*.doubleclick.net
*.er.spo.spotify.com
*.fastclick.net
*.googleadservices.com
*.googletagservices.com
*.inmobi.com
*.jumptap.com
*.lp.mydas.mobi
*.media-match.com
*.moatads.com
*.mocean.mobi
*.mojiva.com
*.msecn.net
*.mycomscore.com
*.mycomscore.net
*.omaze.com
*.pubmatic.com
*.smaato.net
*.smartadserver.com
*.spotx.tv
*.sublimerevenue.com
*.thewhizmarketing.com
*.tune.com
*.video-ak.cdn.spotify.com
*.voicefive.com
*.w.inmobi.com
(.+[-_.])??adse?rv(er?|ice)?s?[0-9]*[-.]
(.+[-_.])??m?ad[sxv]?[0-9]*[-_.]
(.+[-_.])??telemetry[-.]
(.+[-_.])??xn--
(.+[_.-])?ad[sxv]?[0-9]*[_.-]
(.+[_.-])?adse?rv(er?|ice)?s?[0-9]*[_.-]
(.+[_.-])?telemetry[_.-]
ad[0-9.-]*\..*\.(com|net|org)$
adim(age|g)s?[0-9]*[_.-]
ads[0-9.-]*\..*\.(com|net|org)$
adserv.*[-.]
adsrv.*[-.]
adtrack(er|ing)?[0-9]*[_.-]
advert(s|is(ing|ements?))?[0-9]*[_.-]
advert.*[-.]
aff(iliat(es?|ion))?[_.-]
analytic.*[-.]
analytics?[_.-]
banners?[_.-]
beacon.*[-.]
beacons?[0-9]*[_.-]
count(ers?)?[0-9]*[_.-]
partner.*[-.]
promo.*[-.]
stat(s|istics)?[0-9]*[_.-]
track(ing)?[0-9]*[_.-]
track.*[-.]
027cgb.com*.gif
111.175.219.*.js
113.17.188.*.js
120.27.*.html
121.40.136.114*.htm
122.225.103.*.htm
173.208.177.227*.gif
180.96.27.85*.htm
182.92.234.239*.html
2008mm.com*.js
201*.myhard.com
218.26.217.*.html
219.238.159.181*.html
219.238.159.182*.html
221.5.69.52*.js
222.45.224.77*.js
24h*-ad.24hstatic.com
31vcd.com*.js
597txt.com*.php
6080yy.net*0x
61.164.108.184*.gif
61.164.108.184*.swf
63ef.com*.js
6665432.com*.gif
8*.tianya.cn
883ads.com*.js
98441.com*gg
a*.chajiaotong.com
abbao.cn*adblock
abminbuy.com*.gif
ac*.pingguolv.com
activity.*.miui.com
ad*.24hstatic.com
ad*.nexage.com
ad*.tmgrup.com.tr
ad.mail.ru|
adcounter*.uptolike.ru
adi*.cnool.net
admicro*.vcmedia.com
admicro*.vcmedia.vn
ads*.autodaily.vn
ads*.careerbuilder.vn
adserver.*.yahoodns.net
adsrvmedia.adk2.co$important
adtima*.zadn.vn
adv0*.msa.cdn.mediaset.net
aff*.kolektiva.net
alishop*.ru
analytics*.carambo.la
analytics*.clickdimensions.com
analytics-beacon-*.amazonaws.com
analytics-rollout-*.amazonaws.com
analyzer*.fc2.com
anet*.tradedoubler.com
api*.batmobi.net
api*.batmobil.net
api-*.bidmachine.io
at*.doubanio.com
ax.*.ifeng.com
baidu-taobao-av.com*.gif
baiyug.cn*ad.js
banner*.kinogo.by
banner.*.tccapis.com
bar*.shinobi.jp
bbs.gmbbk.com*.js
bdcpro*.techweb.com.cn
bdlm*.hc360.com
bet.championat.com$important
bi-eventtracker-*.amazonaws.com
block.s*block.com
bzfl-1.cc*.gif
cachead.com*.js
cdn*.swaxis.com
cdn-adn-*.rayjump.com
cdnjjvcd.com*.php
ce-global-track-*.amazonaws.com
citysbs.com*swf
clicker.com*pageurl
collect.*.miui.com
collector-*.perimeterx.net
collector-*.tvsquared.com
counter*.freecounter.ovh
counter*-yadro*-ru.unblocked.lol
cs*.mp3bars.com
d*.ruiwen.com
d*.xinshipu.com
d71e6dd31a026d45.com*
data.mistat.*.xiaomi.com
datacollect*.abtasty.com
deloton.com$important
device-metrics-*.amazon.com
dingniugu.com*public.
dm*.ppzuowen.com
dm*.yxlady.com
dn*.ixinwei.com
dualstack.adbert-web-lbs-*.elb.amazonaws.com
dw-informer-*.newsru.com
dybee.tv*.php
dysitecdn.com*.php
eva*-ad.24hstatic.com
fanpingbi*.gaokao.com
fdc.com.cn*adv
fenglan.oss-cn-shenzhen.aliyuncs.com*.gif
firefoxchina.cn*49560.
flurry.agentportal-*.yahoodns.net
flurry.agentportal.*.yahoodns.net
fpb*.51edu.com
fulisuo1.com*.gif
fulisuo7.top*.gif
gcw*.2liang.cn
geoloc*.9cd47096ab1495d8d3b18667f6a52b9c.com
geoloc*.geo20120530.com
geoloc*.geostats.ovh
geoloc*.geovisite.ovh
gonews*.net
gscounters.*.gigya.com
haituie.com*.gif
hao6666.info*.js
hits-*.iubenda.com
hkitblog.com*-banner
hostingcloud.*.wasm
houyi.baofeng.net*.html
iask.cn*.data.
iask.cn*.param.
iask.cn*commercial.
iask.cn*model.
ifengimg.com*300x600
ifengimg.com*600.swf
ifengimg.com*600-2.swf
imedown.info*.gif
img*.hc360.com*.swf
img.90bfw.com*.gif
img.sex169.info*.gif
img2.plures.net*.gif
imp*.tradedoubler.com
impservice*.yodao.com
impservice*.youdao.com
iphone-caviar.ru*entranceid
iptracker-lb-*.amazonaws.com
itmsc.cn*ad0
ja*.gamersky.com
js*.abolezi.com
jupi.cc*.gif
kwflvcdn.000dn.com*.flv
leagueofmovie.com*ads
lively-collect-elb-*.amazonaws.com
log-*.previewnetworks.com
log*.zing.vn
log.zing*.vn
logger-*.dailymotion.com
lt*.tritondigital.com
luolidashu.xyz*.gif
lyd.com.cn*950-90.
marketplace-ios-*.hyprmx.com
mat.chasedream.com*.gif
mcdp-*.outbrain.com
mediate-ios-*.hyprmx.com
meitui.org*.js
metric*.rediff.com
metro-trending-*.amazonaws.com
mf*.advantage.as
minero-proxy-*.sh
minitds-*.info
mobileanalytics.*.amazonaws.com
mobileoffers-*-download.com
nlrsq.com*.js
nv-ad*.24hstatic.com
oclasrv.com$important
orbit*.lun.ua
ow*.biqugego.com
p0y.cn*.swf
pansoso.com*pss.
pianjicdn.com*.php
pic.jd-bbs.com*.swf
pixel.4players.de$important
pixel.wp.pl$important
play*.videos.vidto.me
production-adserver-*.amazonaws.com
putrr*.com
pw321.com*.gif
qizhihaotian.com*.js
qqfby.com*ad
qunlove.com*.gif
r3sub.com*0.gif
rb*.design.ru
rcdn.pro$badfilter
real*traf.ru
report*.appmetrica.webvisor.com
rtbimp-loadbalancer-*.amazonaws.com
s*.adduplex.com
s*.site.flashx.cc
s*.skencituer.com
s*.web.flashx.co
s1-a1.dnvodcdn.me*.svg
sam*.baby-kingdom.com
same*.stockstar.com
sax*.sina.
shaoxing.com.cn*gg.
sock*-goguardian.pusher.com
ssnn.net*-200-250.jpg
ssp*.rtb.beeline.ru
stat*.1internet.tv
static*.365inews.com
stats-*.p2pnow.ru
stats2.*.fdnames.com
subnewss*.net
sudupan.com*.gif
targeting.*.arcpublishing.com
techweb.com.cn*aliyun
teleriumads-*.netdna-ssl.com
tiimg.com*.gif
tracking*.euroads.fi
tracking.*.miui.com
trk*.vidible.tv
tupian55.com*.gif
tw*.netcore.co.in
twaifei.info*.gif
u*.takru.com
uc*.atobo.com
union*.365inews.com
v.newsportal*.ru
vnet.cn*.html
vodtw.la*xhtml
vtnlog-*.elb.amazonaws.com
w*statistics.info
wallstcn.com*-300x250.
webkaka.com*x50.
wenkuxiazai.com*reward
wicp.net*.swf
woolik.com*tracker
wuyulou.com*.php
www.fuze-hill*.xyz
www.fuze-sea*.xyz
www.wifi588.net*.gif
www1.wi.to*.gif
x81zw.com*.gif
xdytt.com*.php
xntk.net*.html
xxx55tp.com*.gif
ya*.dwstatic.com
yktang8.com*.gif
yy18.info*thanks
yyo876.com*.php
yyoyyu.com*.php
zalo-ads*.zadn.vn
zhimg.com*adx
blog.n??tztjanix.net
c7paintedparts.com?5ybuk=ykszqajinq3luw
carlicenseplateframes.com?6vo5=aprqtoksauztgyytprgkycqzcqi
carlicenseplateframes.com?75hlk=foubcujinq3luw
chefbecktruefoodconfessions.com?8fpim=guboirsafwgnlzmpiacvmbyr3luw
dammk??rret.se
estimatorfind.com?8bi=vzqhiafs3iqhzlmpaekdir
etasmarttraining.info?0sy7=lbyumbrp3iqhzlmpaekdir
garywhitakerfamily.net?4p5e3=cjhomqz.3iqhzlmpaekdir
headshopsmell.com?8m11q=faluvzfqbofpuuyybch
iespimeeting.com?732yji=goycpb3iqhzlmpaekdir
ilovepatchouli.com?2zshe=lbikqhbsd0fqbofpuuyybch
patchouliscent.com?48=nqgkcqia3iqhzlmpaekdir
straightshot.us?1z6zj=ucurcfjinq3luw
uberreviewer.com?5euxa=ublsfpjinq3luw
ubertudor.com?55k=ybqimpjinq3luw
virtualpaintexpo.com?67=ypycpb3iqhzlmpaekdir
wp.auto-einstellpl??tze.at
activat.*kas-labs
activat.*kaspersky
ads-*.spotify.com
alibababusiness.net;1
arlmaraceclub.com;1
bigjow.com;1
bireysel-z??raat.com
bireysel-z?raat.com
blancesugrhlthditefrmula.us;1
bloodypresurnewshj.us;1
bogintonline.com;419
booklockdwnforchildnew.us;1
catrinakailalusa.ru;1
darmowa.eu;1
digicool.site;1
dirllquest.net;419
dmxtry.site;1
emailing.top;1
erektionsproblemebeheben.eu;1
eserverk88.com;1
fatcillerprogmnewdjk.us;1
fibooorelisjhdxchsd.us;1
funnyheart.cf;1
g00\.youtube.com$
g02781z1ufn4.biz;1
hedefeposta.com;1
hedefserver.com;1
hlthorblmlught.us;1
hotelsachasmanchester.net;419
icoln.icu;1
imageads*.googleadservices.com
incrivelpravoce.com.br;1
jupiter*.appads.com
ketobdeliciousjhjskjnsghg.us;1
ketofoodhjdietshj.us;1
knisp.icu;1
matingrift.tk;1
mflgruop.com;419
miracleangelinmeardth.us;1
mobilevalut.info;1
neptune*.appads.com
nremik.xyz;1
offremail.eu;1
pagead*.googlesyndication.com
pixel*.spotify.com
portalaican.com;1
pyujsa7o8axa.biz;1
req*.appads.com
rhsde.xyz;1
richwayfinancialservices.co.za;419
s??rahah.eu
s??rahah.pl
saturn*.appads.com
settings-crashlytics-*.*.elb.amazonaws.com
settings-crashlytics-*.us-east-1.elb.amazonaws.com
staysafehomebooklaunch.us;1
tzz72.top;1
unicappi.info;1
visionkooriboostld.us;1
vrisok.xyz;1
www.huala??e.cl
www.huala?e.cl
xn--80aboyon9b3av.xn--p1ai;1
xn--90a5af2aabif.xn--p1ai;1
xn--90akf1acw8d.xn--p1ai;1
xn--d1aqfnc7a6bo.xn--p1ai;1
xn--f1agb5at1b.xn--p1ai;1
hl2guide commented 4 years ago

Thanks for reporting this. Since AdGuard Home supports regular expressions quite a few of the above lines are valid. I'll review this over the next few days.

p1r473 commented 4 years ago

Thanks. Though some domains may look okay, my script is still picking up illegal non-standard characters in them

hl2guide commented 4 years ago

This issue has been added to my next big update scheduled for July 2020.