Add a way to label specific examples or force relabeling on old examples.
The person labeling should have the same context the model will have at inference. We should also filter (remove only mentions, replace mentions with @user, ...) and preprocess the data.
Sort by model uncertainty. We should focus on False Positives and labels without much confidence. Similar to what Prodigy does.
A few ideas we can implement.