Closed raghavrv closed 6 years ago
I would like to solve this issue. Cause I have just achieved a multi-label classification using Matlab, I can try to collect data from the emotions dataset and test it using sklearn.
@wujw13 Please go ahead!!
@rvraghav93 I think the emotions dataset may not be used. In its End User License Agreement, it suggests that "The user may not distribute the dataset or portions thereof in any way". Is there second choice of a dataset that fits requirements?
Wait lets ping @arjoly or @amueller for suggestions :)
An another good choice is the "scene" dataset. I hope it is not "too" big in term of memory footprint.
All right. I will check this dataset.
yeast is smaller, right?
yes it's a bit smaller. yeast can be easily obtain using mldata (through fetch_mldata).
I would like to work on this issue if no one is working on it
Ping @rvraghav93 @arjoly @amueller I just want to make sure whether we are going to add the yeast dataset or something else ?
@kshitij10496 Yes please go ahead and add the yeast dataset. Refer #5325
@rvraghav93 Thanks I am working on it
@amueller as you said in your comment, is that still valid?
Will the first step towards completing PR https://github.com/scikit-learn/scikit-learn/pull/5960 be to address your comment?
Also I don't see no 'yeast' dataset included with scikit-learn.
Don't we have rcv1 as a real multilabel dataset with a fetcher? Yes, I think this can be closed.
Add a dataset with multi-label data.
Refer #5105
Preferably the emotions dataset