first20hours / google-10000-english

This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
Other
3.88k stars 1.93k forks source link

Some bad words not filtered from clean versions #16

Closed Elizafox closed 6 years ago

Elizafox commented 6 years ago

Here are some of the bad/potentially offensive words I've found that aren't being filtered (click triangle to show):

Bad words sexcam, livesex, jo (slang abbreviation for masturbation), worldsex, vibrators, cumshots, twinks, xnxx (porn site), shemales, upskirts, milfhunter, milfs, bangbus (porn site)


There's probably a few others I'm not noticing or my moral compass doesn't think are a big deal, but those are the big ones.

Elizafox commented 6 years ago

There's also a lot of medications prone to abuse like xanax and vallium, but that's a subjective judgement whether or not those are "offensive".