Open xiaozhongtian opened 5 years ago
Does the same issue affect distributed XGBoost without dask (e.g. https://xgboost.readthedocs.io/en/release_0.72/tutorials/aws_yarn.html)?
I haven't tried it, maybe i will try next Monday. but I found that https://xgboost.readthedocs.io/en/release_0.72/tutorials/aws_yarn.html is not existed in the latest version. https://xgboost.readthedocs.io/en/latest/tutorials/aws_yarn.html It's really intresting.
Hello, I find maybe a bug about the XgboostClassifier in dask.xgboost.
with the intial xgboost , we can easily get 100% accuracy.
with the same parameter and the same data, we can only get 66% accuracy and the problem is that the estimator with predict() only returns 1 all the time. The 66% have no sense.
This is a simple example to show the bug. I have tested on my project with titanic dataset and it has the same problem.
est.predict(df).compute()
return 1 for all the df.