implement label space down sampling

ChristianSch / skml

scikit-learn compatibel multi-label classification

http://skml.readthedocs.io/en/latest/

MIT License

6 stars 3 forks source link

implement label space down sampling #19

Open ChristianSch opened 6 years ago

ChristianSch commented 6 years ago

Just like in the original PCC paper we'd like to introduce a way to remove labels from a given label vector easily. These are a few methods that come to mind:

[ ] by-threshold: only retain labels that occur in, say 95% of the instances
[ ] most-frequent: take only the top k labels that occur the most frequent (see PCC paper)