Xtra-Computing / thundersvm

ThunderSVM: A Fast SVM Library on GPUs and CPUs
Apache License 2.0
1.56k stars 216 forks source link

Non-sparse formats #170

Closed beevabeeva closed 4 years ago

beevabeeva commented 4 years ago

Hi,

I have been planning to use ThunderSVM (and ThunderGBM) in my research. Briefly, this research consists of implementing state of the art GPU algorithms in the ATM framework. The ATM framework uses non-sparse matrix as the input format. I have posted an issue regarding this on their repo too. I know that your work is based on LIBSVM and that the input format is somewhat of a contentious issue. But my question is: How difficult would it be to implement support for non-sparse/LIBSVM input formats? In particular, ATM expects input in the form:

feature_01,feature_02,feature_03,feature_04,class
5.1,3.5,1.4,0.2,Iris-setosa
4.9,3.0,1.4,0.2,Iris-setosa
4.7,3.2,1.3,0.2,Iris-setosa
4.6,3.1,1.5,0.2,Iris-setosa

Thanks,

Avi

zeyiwen commented 4 years ago

A parser for the data format you have mentioned will be needed. You are more than welcome to contribute.