aksnzhy / xlearn

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
https://xlearn-doc.readthedocs.io/en/latest/index.html
Apache License 2.0
3.09k stars 519 forks source link

What is the format of training data? #347

Open CoinCheung opened 4 years ago

CoinCheung commented 4 years ago

Hi,

I see the example dataset is here in the demo/classification/titanic/titanic_train.txt, the first 5 lines are:

0       1       0       1       0       0       0       1       0       1       0       0       1       -0.56136323207  -0.502445171436
1       1       0       0       1       1       0       0       1       0       1       0       0       0.613181832266  0.786845293588
1       0       0       1       0       0       0       1       1       0       0       0       1       -0.267726965986 -0.488854257585
1       1       0       0       1       0       0       1       1       0       1       0       0       0.392954632703  0.420730236069
0       0       0       1       0       0       0       1       0       1       0       0       1       0.392954632703  -0.486337421687

I do not know which is training feature and which is label in each line. Would you please tell me which is the label and feature vector ?