ray-project / xgboost_ray

Distributed XGBoost on Ray
Apache License 2.0
143 stars 34 forks source link

rabit timeout #256

Open Biu-G opened 1 year ago

Biu-G commented 1 year ago

xgboost.core.XGBoostError: [23:55:55] /home/conda/feedstock_root/build_artifacts/xgboost-split_1667849645640/work/rabit/include/rabit/internal/socket.h:170: Poll timeout

what params should i pass to avoid this timeout? im training a huge dataset

Yard1 commented 1 year ago

Could you provide some more details? What is your cluster setup, data size & shape, how is your training script looking like, does this issue happen with a smaller dataset?