help needed for xgboost bring-your-own with distributed training

aws / amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Apache License 2.0

10.02k stars 6.75k forks source link

hi guys, we are trying to create our own Docker image for xgboost distributed training. We have managed to successfully create an image for single instance training, but are unable to figure out how to hook to the xgboost distributed backend.

I see there is an equivalent example for other algos - https://github.com/awslabs/amazon-sagemaker-examples/blob/0e17c31a69ad014bee71fa9c4d700d35cd30a421/advanced_functionality/fairseq_translation/fairseq/train#L112-L119

is there any way that you guys could share an example of a BYO distributed training docker image for xgboost ?

aws / amazon-sagemaker-examples

help needed for xgboost bring-your-own with distributed training #767