alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Apache License 2.0
8.17k stars 1.12k forks source link

Implement profanity filter #623

Open nshmyrev opened 3 years ago

dev2devportal commented 3 years ago

Subscribed. Looking forward to figuring this out. Will also hopefully be bringing in at least 1 or 2 other people to help with figuring this out and then testing in our R&D environment before sending to the Development team for integration into the LMS.

nshmyrev commented 3 years ago

https://pypi.org/project/profanity-filter/ is a useful library/data

lmhawke commented 3 years ago

Yes, that is the one we were looking at trying to figure out how to integrate with Vosk and Jitsi (Jigasi).

abb128 commented 3 years ago

Steam's chat filtering has a feature where you can switch between three different modes: filtering nothing, filtering only slurs and filtering all profanities.

image

I think this is an important feature to consider because in many cases people may tolerate casual swearing, but not racism and slur-dropping.

dev2devportal commented 3 years ago

FYI, our intern was able to get profanity filtering working. I'll see about getting his information submitted into these relevant tickets with the information on how to do it (I have to get permission from work to do so).

dev2devportal commented 3 years ago

Here is the implementation information on what we used: https://github.com/alphacep/vosk-server/issues/110

dev2devportal commented 3 years ago

We still have to migrate this into our official DEV and PRD cycle, will let folks know how that goes. If there are any bugs or user issues found as it goes through official DEV and QA.