dmlc / xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
https://xgboost.readthedocs.io/en/stable/
Apache License 2.0
25.78k stars 8.69k forks source link

Missing XGBoostRanker in xgboost4j-spark jvm package #10118

Open Meleeri opened 3 months ago

Meleeri commented 3 months ago

On the latest version (2.03) of xgboost4j-spark XGBoostClassifer and XGBoostRegressor are available for classification and regression trainning tasks, but there seems to be no equivalent pipeline for learning-to-rank tasks. How should I train a ranking model using current pipelines or lower level apis?Has anyone successfully run it on spark before?

wbo4958 commented 3 months ago

No, XGBoost JVM package doesn't support XGBoostRank for now, but I think it's the right time to support it in XGBoost. @trivialfis

trivialfis commented 3 months ago

The JVM package implements LTR as regression. Yeah, we can have some additional features in there, but let me finish the networking first, it's a huge time killer

wbo4958 commented 3 months ago

Yeah, We should move this part from regression to XGBoostRank just like xgboost pyspark