NirantK / Hinglish

Hinglish Text Classification
MIT License
30 stars 10 forks source link

Find Top 200 samples to be most likely wrong using Cleanlab #5

Closed NirantK closed 4 years ago

NirantK commented 4 years ago

We will use these results to manually verify how much to trust this dataset labels itself

E.g. if the error is say, more than 5% - we will pass this value during test time prediction, which will have the same label error rate hopefully

NirantK commented 4 years ago

Fixed in #1