Closed johnbent closed 3 years ago
I've been considering a feature for woke that would allow inclusion of multiple "lists" that would allow you to "extend" your ruleset without including them by default. They could be included in woke, or maintained by the community on GitHub/remotely. Something along the lines of
extends:
- default
- https://github.com/get-woke/rulesets.git/vulgarity.yaml
- file:///home/me/myruleset.yaml
I came across https://github.com/hashicorp/go-getter the other day which would be interesting to use, but I haven't had the chance to dig into it much. This would give the ability for the community to maintain lists without forcing rules on all users. Thanks for bringing this up!
https://github.com/retextjs/retext-equality/tree/main/data/en is by far the largest word lists I've found publicly maintained.
Closing in favor of #104
Is your feature request related to a problem? Please describe. We are looking for a single tool to scan our code for problematic language. There are at least three types of problematic language that I think can be scanned:
Describe the solution you'd like A maintained word list for vulgarity and slurs.
Describe alternatives you've considered We could do it ourselves but we want to share this with the community and have confidence that we are using appropriate word lists.
Additional context It seems like a fair number of folks are building similar word lists. INI, inclusivelint. We should share a word list I think.