Added train_model.py and made the necessary modifications

menkotoglou commented 4 years ago

Reverse engineered training script
Updated versions of scikit-learn and included it in setup.py and requirements.txt
Retrained model with training data and updated version as in (2)
Dropped support for Python3.5 (see Travis configuration), because it was no longer compatible with current scikit-learn version
Fixed test as one test case is now below threshold

We tested these changes on a private dataset with the following results:

Before:

Predicted Actual | Not Profane(0) | Profane(1) Not Profane(0) | 703 | 14 Profane(1) | 93 | 39

Accuracy Score: 87.4%

After:

Predicted Actual | Not Profane(0) | Profane(1) Not Profane(0) | 697 | 20 Profane(1) | 87 | 45

Accuracy Score: 87.4%

ieshaan12 commented 3 years ago

@koti How do I use this build?

dimitrismistriotis commented 3 years ago

@koti How do I use this build?

By referencing the other repository. For pip + "requirements.txt", use the following instead of "profanity-check":

-e git+https://github.com/dimitrismistriotis/profanity-check.git#egg=profanity-check

Also check this issue here if @vzhou842 accepts it, you can bring back profanity-check.

ieshaan12 commented 3 years ago

@dimitrismistriotis Thanks! I was wondering if we could implement a function which censors content like the profanity package?

dimitrismistriotis commented 3 years ago

@dimitrismistriotis Thanks! I was wondering if we could implement a function which censors content like the profanity package?

Censor is a very broad concept, also didn't get the "like the profanity package" part: profanity detects

vzhou842 / profanity-check