hagezi / dns-blocklists

DNS-Blocklists: For a better internet - keep the internet clean!
GNU General Public License v3.0
6.86k stars 227 forks source link

List of suggestions for sources of hosts data Gambling (Indonesia) #2207

Closed alsyundawy closed 8 months ago

alsyundawy commented 8 months ago

Which domain(s) should be blocked?

List of suggestions for sources of hosts data Gambling (Indonesia)

https://raw.githubusercontent.com/alsyundawy/TrustPositif/main/judi-Sep23_20Feb2024.txt

Why should these domain(s) be blocked?

indonesia gambling site

hagezi commented 8 months ago

Thanks @alsyundawy, I'll take a look and add it to the basic sources for the gambling list if necessary.

Is the list updated or is it static, I ask because the list name contains a time window? If it is not a static list, you should choose a different name for the list, e.g. gambling_in.txt

What is the source of the list?

Cheers, Gerd

alsyundawy commented 8 months ago

Thx for response

I will try to update a maximum of once every 2-3 days, currently only using the list September 2023 - 20 February 2024. Meanwhile, I will update the database before or after.

hagezi commented 8 months ago

@alsyundawy What is the source of the list?

alsyundawy commented 8 months ago

i am engineer internet service provider from indonesia

indonesia have rules to bloking porn, scamming, copyright & gambling from indonesia Ministry of Communication and Informatics (Kementerian Komunikasi dan Informatika Republik Indonesia) kominfo.go.id. this list not for publik but for internet service provider only. they send daily list to all ISP via email, then make compilation then for you just gambling site online

https://prnt.sc/1BDFIiDzZKhC

https://prnt.sc/URdIgYVh1Z5x

you can check list https://trustpositif.kominfo.go.id/

https://prnt.sc/IOTww_OEit_b

hagezi commented 8 months ago

Thanks @alsyundawy

Result of my quick analysis:

"Collateral damage", false positive domains, no gambling:

*.canada.ca
*.gov
*.hp.com
*.uol.com.br
123recht.de
amazon.com
b.link
bbva.es
buttondown.email
emailmeform.com
eveninsight.com
faculty.um.edu.sa
heylink.me
linkpop.com
livescore.com
page.link
postimg.cc
restaurantguru.com
scorebar.com
shoestringacres.com
taptap.io
terminalserviceplus.com
vercel.app
vikramuniversity.org

Dead domains: 6006 NXDOMAIN | 16808 SERVFAIL

Will it stay with the file name on Github? Then I'll add the following list as a source to the gambling list: https://raw.githubusercontent.com/alsyundawy/TrustPositif/main/judi-Sep23_20Feb2024.txt

Cheers, Gerd

hagezi commented 8 months ago

@alsyundawy I have added your list to the gambling list, if there are any changes in the name, just let me know.

The updated gambling list is online.

hagezi commented 8 months ago

@alsyundawy I investigated the list further, it contains a lot of 404 domains. I have sorted out the "rubbish" so far and only include active domains that have been on any of the top 1M lists in the last 12 months. I don't want to inflate the gambling list with domains that are never called anyway or deliver 404. Of the 160000 domains, ~35000 domains currently remain.

alsyundawy commented 8 months ago

heylink.me owned by gambling site, they use for sort link

iam-py-test commented 8 months ago

heylink.me owned by gambling site, they use for sort link

heylink.me seems to just be a general purpose URL shortener owned by the marketing company Persollo Pty Ltd (which was mistakenly included in my filterlist). Thanks

hagezi commented 8 months ago

switched source to https://raw.githubusercontent.com/alsyundawy/TrustPositif/main/gambling_indonesia.txt

alsyundawy commented 8 months ago

thx update now

alsyundawy commented 8 months ago

Update 08032024

alsyundawy commented 8 months ago

Update 11032024

alsyundawy commented 7 months ago

Update 02042024

alsyundawy commented 7 months ago

hi @iam-py-test @hagezi the last update from my list is 5-15k domain. why this repo no update?

then how make validation all domain my list with tld https://data.iana.org/TLD/tlds-alpha-by-domain.txt

hagezi commented 7 months ago

hi @iam-py-test @hagezi the last update from my list is 5-15k domain. why this repo no update?

then how make validation all domain my list with tld https://data.iana.org/TLD/tlds-alpha-by-domain.txt

https://github.com/hagezi/dns-blocklists/issues/2207#issuecomment-1957461777

"I have sorted out the "rubbish" so far and only include active domains that have been on any of the top 1M lists in the last 12 months. I don't want to inflate the gambling list with domains that are never called anyway or deliver 404."

# Download and convert Sourcelists ...

  Nr |   Count | Format  | Source | Status  | File      | URL/File
   1 |  379333 | domains | http   | online  | unchanged | https://raw.githubusercontent.com/alsyundawy/TrustPositif/main/gambling_indonesia.txt

# Build gambling.top Domainlist ...

Stats gambling.top:

** Source (raw):    379333
++ FLD:             379337 (+4)
++ WWW:             673254 (+293917)
-- OnlyAt:          74539 (-598715)

74539 unique Domains - Version 2024.0406.2218.45
MD5 Domains RAW: c186844c33cd2cfe6fe9b28a1359cb83