Just like in the original PCC paper we'd like to introduce a way to remove labels from a given label vector easily. These are a few methods that come to mind:
[ ] by-threshold: only retain labels that occur in, say 95% of the instances
[ ] most-frequent: take only the top k labels that occur the most frequent (see PCC paper)
Just like in the original PCC paper we'd like to introduce a way to remove labels from a given label vector easily. These are a few methods that come to mind:
by-threshold
: only retain labels that occur in, say95%
of the instancesmost-frequent
: take only the topk
labels that occur the most frequent (see PCC paper)