columndeeply / hosts

Unified porn blocklist. More than 10 million domains as of 2024. Updated monthly.
88 stars 9 forks source link

Fixing cleanup.sh to remove http(s) from the domains #3

Closed columndeeply closed 1 year ago

columndeeply commented 1 year ago

The cleanup.sh script isn't removing http(s) from the urls (as mentioned in #2 by @joe820912boy).

For example, in hosts01: https://eporner.com.es/ https://poop.vids.rip/ https://wargers.org/ https://www.ebookrenta.com/ https://www.pornbfvideo.com/ https://www.pussyboy.net/ https://www.pornv.xxx/ https://www.sexvid.pro/ https://www.xvideos51.com/

I'm also seeing a few that aren't just domains, for example: iptorrents.com/torrents lasmejoreswebsporno.com/en lusthive.com/tym

These should have the "/tym", "/en", "/torrents", etc. The cleanup script should only keep the (sub)domain.

joe820912boy commented 1 year ago

Yes,

there also exists that some url not domain, mainly in host02 LIKE reddit forum or twitter (sub dir format),

It exists some content about adult in it, but I have no idea it's suitable for DOMAIN

But maybe it works for host file used for blocking adult content

columndeeply commented 1 year ago

It should be fixed now. Let me know if you see anything else that should be changed.

joe820912boy commented 1 year ago

Ok sure, thanks for fix it

joe820912boy commented 1 year ago

@columndeeply

"www.xnxx.com › search › x... " "www.xvideos2.com › tags" "xxxvideo.blog.br › videos › xvideos" "www.pornocarioca.com › Vídeos"

in host03 needs be cleaned

columndeeply commented 1 year ago

Done, thanks!

joe820912boy commented 1 year ago

@columndeeply

"www.newgrounds.com" in host03,

"www.deusx.com" in host02

seems they are game websit, not to be adult

joe820912boy commented 1 year ago

And one more thing I want to confirm that,

Some domains like "git.git.git."

in host01 git.git.git.ladyboypornpics.net git.git.git.larahill.hotsexyandbigtity.chttpth5.opt-vk.ru git.git.git.laughsexround.email git.git.git.leatherbondagestore.gci.jaxdental.com git.git.git.lemmaeof.gay git.git.git.lesleycarter.sexycandidgirls.com git.git.git.lesyaromanovski.hotsexwww.dom.tyt.cash git.git.git.letsexplore.cf git.git.git.license.instaporn.to git.git.git.lime.cryptoxxxxl.com

where are these sources come from which repository?

columndeeply commented 1 year ago

Thanks for letting me know @joe820912boy. I've whitelisted deusx.com. NewGrounds is still banned because according to their rules they allow porn games/pictures. If anybody still wants to access the site they can whitelist it in their PiHole/AdGuard/whatever instance.

The "git.git.git" domains seem to be coming from a few sources, but the main one is: https://github.com/RPiList/specials/tree/master/Blocklisten

As far as I know it's still a valid domain so I would leave them listed.

Btw, if you find anything else could you open a new issue? Not sure why but GitHub isn't notifying me once an issue has been closed so I might miss your messages.

joe820912boy commented 1 year ago

OK, thanks

I will open new issue if find some invalid domain