Closed zardilior closed 5 years ago
@vzhou842 we have seen there are no good spanish profanity checkers available, we would be interested in generating that labeled spanish dataset to train on. How would this dataset be in order to be useful to you?
You'd just need to have a large size of examples of both profane spanish text and clean (not profane) spanish text. Ideally you'd probably want 100k+ examples.
mmm.... If I make an open source list it might be possible
... That would be great.
You'd need a labeled spanish dataset to train on, which unfortunately I don't have.