Lambda ranker does not throw error when labels are strings as oppose to ordinal int.

microsoft / NimbusML

Python machine learning package providing simple interoperability between ML.NET and scikit-learn components.

Other

281 stars 62 forks source link

My team is currently using LightGBMrank through nimbus for some ranking problems. However, we are a bit confused about the data type required for the label column – I couldn’t find too much documentation on this.

I tried a few iteration based off of the default example given in the LightGBMrank documentation, which had ordinal labels. Here are the iterations I tried:

The default, with ordinal labels
Changed data input to a data frame to make sure output is the same. It is.
Remapped labels to str format {0: “Bad”, 1: “Fair”, 2: “Good”, 3: “Excellent”}.
Remapped the ordering, and added a random label “Goofy”

The results of these 4 on NDCG are different, and none broke the classifier.

The ipython notebook attached has code to reproduce the issue.

lambdaRankTest.zip

Thanks, Mike

microsoft / NimbusML

Lambda ranker does not throw error when labels are strings as oppose to ordinal int. #94