hagezi / dns-blocklists

DNS-Blocklists: For a better internet - keep the internet clean!
GNU General Public License v3.0
7.19k stars 241 forks source link

Source Suggestions #2537

Closed jarelllama closed 7 months ago

jarelllama commented 7 months ago

https://github.com/infinitytec/blocklists http://phishing.mailscanner.info/phishing.bad.sites.conf

I have yet to check them for dead domains and false positives. I'll do so in a few hours if you've yet to.

hagezi commented 7 months ago

Thanks, I'll have a look at it in a few days.

jarelllama commented 7 months ago

https://raw.githubusercontent.com/infinitytec/blocklists/master/scams-and-phishing.txt (I have yet to look at the other blocklists in the repo):

# Total entries
$ wc -l scams-and-phishing.txt 
22779 scams-and-phishing.txt

# Unique entries
$ comm -23 scams-and-phishing.txt tif.txt | wc -l
22395

# Dead entries:
$ wc -l dead.tmp
13483 dead.tmp

# Domains in toplist:
$ comm -12 scams-and-phishing.txt toplist.tmp
000webhostapp.com
accintrend.com
advexplore.com
amarktflow.com
amazonses.com
americancialisnow.com
americanlisted.com
americanviagranext.com
avivaromm.com
backscatterer.org
banyanhill.com
benefitsdepot.net
bestblackhatforum.com
bestdayeversweeps.com
besttechtrend.com
besttipsdaily.com
blackhatworld.com
boncharge.com
buszcentrum.com
buyercenter.help
buyviagraotcusa.monster
buzzworthyoffers.com
caphemoingay.com
casinomeritroyal.com
cheapgeneric.monster
cheaprxprednisonetablets.monster
cheaptablets20mg.monster
choicegoldcard.com
ciaaliss.com
cialis20mgbuypillsnorx.monster
cialis20mgotcnorx.monster
cialis20prescriptionotconline.monster
cialisblacksnorx.com
cialisgenericbuy.quest
cialisnorx20mgonlineotc.monster
cialispillsbuyonlinegeneric.quest
cialiswithoutprescriptiononline.quest
cleckleyfloors.com
click4riches.com
convertwithwave.com
cosmiccuts.com
countrywideconcealed.com
cpaelites.com
cstpersl.com
customwritings.com
dailytoptips4u.com
deepwebsiteslinks.com
defendershield.com
desertcart.com
diaart.org
dosage20mgcialisa.com
elexusbet147.com
eurocasinogir.com
everydayread.net
foundmoneyguide.com
freebitco.in
freescore360.com
freewebsitetemplates.com
g2afse.com
gamersunite.com
generic.monster
genericcures.com
genericonline.monster
getitfree-samples.com
getpuravive.com
getquickmanuals.com
girbahise.com
gowavebrowser.com
grantsreach.com
gshopper.com
healy.shop
help.law
herpessymptomsinmen.org
hhydroxychloroquine.com
hotel-ds.com
icalserver-multisite.com
infopathy.com
intellihub.com
ipsnews.net
ivermectin12mg.quest
ivermectin3m.quest
ivermectinak.quest
ivermectinan.quest
ivermectinas.quest
ivermectinasale.quest
ivermectinchp.com
ivermectincovid.quest
ivermectindu.monster
ivermectineb.quest
ivermectined.energy
ivermectinee.quest
ivermectinew.monster
ivermectinflcc.monster
ivermecting.quest
ivermectinj.quest
ivermectinl.quest
ivermectinma.quest
ivermectinmy.quest
ivermectinon.quest
ivermectins.quest
ivermectint.quest
jeewangarg.com
joom.com
keshefoundation.org
kqzyfj.com
l5srv.net
lasixgenericname100mg.quest
lasixgenericname100mgbuy.quest
lautenschlager.net
madridbett.com
manualsdirectory.org
menolparkreport.com
meritkingbahis.com
meritroyalbet1.com
meritroyalbetgiris.me
meritroyalbetotel.com
molnupiravir.monster
moneyandmarkets.com
mycapturepage.com
mytacticalpromos.com
news-headlines.co
newsandpromotions.com
newsbtc.com
oivermectina.monster
onelaunch.com
onlinecasinorealmoneyusa.quest
onlinecasinorealmoneyx.com
onlinetop-tips.com
otohits.net
pchelpsoft.com
popplunder.com
prednisoneca.com
prehomemart.com
productreportcard.com
promotionsonlineusa.com
ragingbullslotscampaign.com
reimageplus.com
resourcesify.com
rewards-locker.com
rewardsavenue.net
rewardsgiantusa.com
romedic.ro
safety-search.com
shieldyourbody.com
sibforms.com
signadios-lodsource.icu
spnccrzone.com
sprkcvr.com
stromectolst.com
stylesforless.com
surveys2cash.com
surveysandpromoonline.com
tacticalusa.com
tadalafilmix.quest
theamericansurvey.com
thepersonalfinancialguide.com
therewardboost.com
thesafersearch.com
topexpertinsight.com
toptrustytips.com
trybandoo.com
uceprotect.net
ultci.com
unemploymentbenefitsguide.com
unifiedlayer.com
upmychrome.com
usaviagraprice100mg.com
variantverdict.com
videopal.me
vincemartin.us
wavebrowser.co
wavebrowser.net
webseed.com
whatifoffers.com
windowssearch-exp.com
wowcher.co.uk
ymcart.com

http://phishing.mailscanner.info/phishing.bad.sites.conf:

# Total entries
$ wc -l mailscanner.txt
36632 mailscanner.txt

# Unique entries
$ comm -23 mailscanner.txt tif.txt | wc -l
19339

# Domains in toplist
$ comm -12 mailscanner.txt toplist.tmp 
1drv.ms
2no.co
4everland.io
707.su
8gwifi.org
abrir.link
academy-skrf.ru
acesse.dev
activationpanel.net
adclicker.io
afrinic.net
allmy.bio
alturl.com
apksos.com
apptuts.bio
appurl.io
atbu.edu.ng
beacons.ai
bezahlen.net
bio.link
bio.site
biolinky.co
bitconce.top
bitly.net
bitly.ws
bueno.art
builderallwppro.com
c8ke.com
cachedview.com
cage.report
cash.app
cdntechone.com
cf-ipfs.com
cfi.net.cn
chilipepper.io
clck.ru
cli.re
cloudflare-ipfs.com
coinvid.com
confirmsubscription.com
corrector.co
cs.money
dhl-news.com
disq.us
donweb.com
dy.fi
dyrk.org
emaze.me
etrack01.com
ezstat.ru
flow.page
flowcode.com
formdesigner.pro
forms.gle
formsubmit.co
gg.gg
glitch.com
gobmx.org
goo.by
goo.su
google.com
googleweblight.com
gyazo.com
heyflow.id
heylink.me
hotm.art
howhow.cl
href.li
hyperfollow.com
idm.in
idolink.com
ilang.in
im-creator.com
infogram.com
infura-ipfs.io
inwayhosting.com
inx.lv
ipfs.io
iplogger.com
issuu.com
itvopen.net
ix-event.com.tr
jali.me
jemi.so
jii.li
jivo.chat
jtbtigers.com
keepo.io
l1nq.com
lihi.cc
lihi1.com
lihi3.cc
lin01.bid
linkbio.co
linkby.tw
linkin.bio
linknbio.com
linkpop.com
linkr.bio
linktr.ee
ln.run
lnk.bio
lnkd.in
loyaltygateway.com
magic.ly
mailstat.us
mandrillapp.com
matjarapk.com
maximilianoperet.com.ar
me-qr.com
mez.ink
mobidrive.com
msgsndr.com
myjaxx.pro
neartail.com
nexoinmobiliario.pe
nftstorage.link
njcuh.cn
ojaawtr.cn
omhhrkg.cn
one-digitalservice.ch
onl.la
onne.link
onx.la
openinapp.co
openinapp.link
ortydfo.cn
pixelfy.me
ppt.cc
promoblackdaysvuelapromo.ru
pxlme.me
q-r.to
qr1.be
qrco.de
qrcodes.pro
qrfy.com
questionpro.com
quip.com
rb.gy
rebrand.ly
replit.com
resume.io
reurl.cc
risu.io
rpp.pe
s.id
scnv.io
scsang.cn
serviciodecorreo.es
shoppy.gg
short.gy
shorter.me
shorturl.at
skro.in
slidesgo.com
snapto.link
snip.ly
socprofile.com
sourl.cn
stackoverflow.com
steprimo.com
sterlitamakadm.ru
streamable.com
streamlink.to
surl.li
surveyheart.com
szvmt.cn
szwlu.cn
t.co
t.ly
t.me
tally.so
tap.bio
taplink.cc
tawk.to
tcheturbo.com.br
telegra.ph
threads.com
tiny.one
tllbvpb.cn
toolsxsocial.in
tr.ee
triagroup.ru
tribelio.page
tt.vg
tukofertas.com.br
tundrafile.com
u.to
unbouncepages.com
uqr.to
urlscan.io
urlz.fr
urlzs.com
usnd.to
utua.com.br
vfmucta.cn
via0.com
vk.com
vkontakte.ru
vu.fr
w-mt.co
w3s.link
wa.me
warriorplus.com
web66.com.tw
webogram.org
wepik.com
wlo.link
workflowy.com
x.gd
xeudbnk.cn
xjwyjfs.cn
xkrmugl.cn
xniyrji.cn
xurl.es
yandex.ru
ydgxgg.cn
yip.su
ylslpw.cn
zpr.io

Will run dead check in a bit.

hagezi commented 7 months ago

Thank you, can you make the dead domains available? Then I don't have to check again. Thank you!

jarelllama commented 7 months ago

Will do, just waiting for dead domains linter to finish.

jarelllama commented 7 months ago

I haven't ran the dead check on the mailscanner list yet but look at some of these false positives from both sources (in the edit). I see some false positives in there like yandex.ru. What do you think?

hagezi commented 7 months ago

So if such domains end up on the list, I won't add them.

jarelllama commented 7 months ago

I'll take a deeper look at https://github.com/infinitytec/blocklists when I have the time later.

jarelllama commented 7 months ago

Hi again, here are the various reports generated for the blocklists in question: https://github.com/infinitytec/blocklists/raw/master/ads-and-trackers.txt: compressed from 53043 to 342, 23% invalid, 19% dead https://github.com/infinitytec/blocklists/raw/master/scams-and-phishing.txt: compressed from 22767 to 3893, 52% dead https://github.com/infinitytec/blocklists/raw/master/mlm.txt: compressed from 143351 to 823, 75% invalid (most contain underscores) http://phishing.mailscanner.info/phishing.bad.sites.conf: 219 in Tranco, probably needs a deeper dive

hagezi commented 7 months ago

Thanks for the effort, I think we should leave it at that.

My conclusion, not worth it.

jarelllama commented 7 months ago

Agreed.