danny0838 / content-farm-terminator

Content Farm Terminator browser extension/「終結內容農場」瀏覽器套件
https://danny0838.github.io/content-farm-terminator/
GNU General Public License v3.0
1.34k stars 47 forks source link

content-farms.txt contains an official government website where you can look up the law #96

Closed galantra closed 6 months ago

galantra commented 7 months ago

The website I mean is landesrecht.rlp.de. It is a web portal for the law of Rhineland-Palatinate, one of the 16 federal states of Germany. Its inclusion in the default blacklist seems like a clear mistake to me.

danny0838 commented 6 months ago

Removed. Thank you for the feedback.

Retia-Adolf commented 5 months ago

In https://danny0838.github.io/content-farm-terminator/files/blocklist/extra-content-farms.txt, it is still included? I'm not sure what is the criteria for the extra list, like even nasa.gov is included? and several others:

rlp.de #!scyrte
nasa.gov #!scyrte
medium.com #!scyrte  # sometimes it requires login to read full article but not all
ctan.org #!scyrte    # TeX archive, definitely not a content farm? if we don't equate a package index site to content farm
brown.edu #!scyrte
danny0838 commented 5 months ago

Removed scyrte from aggregation sources as it's inclusion criteria is not transparent enough and is too likely to include normal web sites.