aws / amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
https://sagemaker-examples.readthedocs.io
Apache License 2.0
10.02k stars 6.75k forks source link

help needed for xgboost bring-your-own with distributed training #767

Open sandys opened 5 years ago

sandys commented 5 years ago

hi guys, we are trying to create our own Docker image for xgboost distributed training. We have managed to successfully create an image for single instance training, but are unable to figure out how to hook to the xgboost distributed backend.

I see there is an equivalent example for other algos - https://github.com/awslabs/amazon-sagemaker-examples/blob/0e17c31a69ad014bee71fa9c4d700d35cd30a421/advanced_functionality/fairseq_translation/fairseq/train#L112-L119

is there any way that you guys could share an example of a BYO distributed training docker image for xgboost ?

aws-patlin commented 4 years ago

Hi @sandys,

We recently released an example notebook for distributed XGBoost in script mode. Please take a look and let us know if this answers your question!