zeeguu / api

API for tracking a learner's progress when reading materials in a foreign language and recommending further personalized exercises and readings.
https://zeeguu.org
MIT License
7 stars 19 forks source link

Add Profanity Filter to Random Words #197

Open tfnribeiro opened 2 weeks ago

tfnribeiro commented 2 weeks ago
tfnribeiro commented 2 weeks ago

Consider using the following library: https://github.com/LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words?tab=readme-ov-file

tfnribeiro commented 1 week ago

Should we also use a list of Names / Cities to not consider them for exercises?

mircealungu commented 1 week ago

We should!

On Thu, 5 Sep 2024 at 09.42, Tiago Ribeiro @.***> wrote:

Should we also use a list of Names / Cities to not consider them for exercises?

— Reply to this email directly, view it on GitHub https://github.com/zeeguu/api/issues/197#issuecomment-2330829409, or unsubscribe https://github.com/notifications/unsubscribe-auth/AADRNB6CILN4QPT7NTEI2V3ZVADMTAVCNFSM6AAAAABNJ2HDO6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMZQHAZDSNBQHE . You are receiving this because you are subscribed to this thread.Message ID: @.***>

tfnribeiro commented 1 week ago

I found some word lists and plugging it in results in about 2.8k words being removed, some of those could be okay, but I guess in this case we will still have plenty of "valid" words to work as distractors, and we avoid the cases where a "London" or "December" come in.

mircealungu commented 1 week ago

We can afford to remove 2.8k words, for sure :)