udibr / noisy_labels

TRAINING DEEP NEURAL-NETWORKS USING A NOISE ADAPTATION LAYER
118 stars 38 forks source link

What are the simplest methods for the label noise problem? #5

Open guotong1988 opened 4 years ago

guotong1988 commented 4 years ago

If I have enough low quality data from unsupervised methods or rule-based methods.

I read from https://github.com/subeeshvasu/Awesome-Learning-with-Label-Noise ,but these methods are a little complex for me.

In detail, I deal with a multi-label classification task. First I crawl web page such as wiki and use regex-based rule to mark the label. The model input is the wiki title and the model output is the rule-matched labels from wiki content. My task is to predict the labels for the wiki title.

Do you think removing the wrong data predicted by trained model is a simple but effective method?

@udibr Thank you very much!

Billy1900 commented 3 years ago

It does, I think. But how could you know the data point is wrong, what is your definition for "wrong"?