I have written a very basic Data Programming model for labeling the twitter dataset of adrmine. This currently is a "bad model" with an accuracy of 40% and biased to a single class in our binary classification among 0 and 1. This may have arisen due to a lot of reasons including -
Quality of LFs(Labelling Functions).
Coverage of quality LFs.
Choice of ML model.
I am self assigning this issue currently. But it would be great to have you work on this.
I have written a very basic Data Programming model for labeling the twitter dataset of adrmine. This currently is a "bad model" with an accuracy of 40% and biased to a single class in our binary classification among 0 and 1. This may have arisen due to a lot of reasons including -