aws / sagemaker-spark

A Spark library for Amazon SageMaker.
https://aws.github.io/sagemaker-spark/
Apache License 2.0
298 stars 126 forks source link

Similar to SageMakerEstimator Is there a way to set hyperParameters map to XGBoostSageMakerEstimator #91

Open sajjap opened 5 years ago

sajjap commented 5 years ago

System Information

Describe the problem

Similar to SageMakerEstimator Is there a way to set hyperParameters map to XGBoostSageMakerEstimator, I do not see that it is an option in API docs. If not will you be able to enable this feature?

knakad commented 5 years ago

Hi @sajjap! Thanks for reaching out! Unfortunately, there are no immediate plans to add this in the near future. That being said, we're always reprioritizing our backlog based on customer feedback and will keep this in mind in the future.

Closing this issue as a duplicate of https://github.com/aws/sagemaker-spark/issues/55 and noting the additional feature request.

sajjap commented 5 years ago

Hi @knakad, Thanks for a quick response. But this isn't related to auto model tuning. I was referring to something similar to the highlighted in below example from sagemaker-spark github page but with the XGBoostSageMakerEstimator

Instead of creating a KMeansSageMakerEstimator, you can create an equivalent SageMakerEstimator:

val estimator = new SageMakerEstimator( trainingImage = "382416733822.dkr.ecr.us-east-1.amazonaws.com/kmeans:1", modelImage = "382416733822.dkr.ecr.us-east-1.amazonaws.com/kmeans:1", requestRowSerializer = new ProtobufRequestRowSerializer(), responseRowDeserializer = new KMeansProtobufResponseRowDeserializer(), hyperParameters = Map("k" -> "10", "feature_dim" -> "784"), sagemakerRole = IAMRole(roleArn), trainingInstanceType = "ml.p2.xlarge", trainingInstanceCount = 1, endpointInstanceType = "ml.c4.xlarge", endpointInitialInstanceCount = 1, trainingSparkDataFormat = "sagemaker")

knakad commented 5 years ago

Oops! Thanks for the callout =)

Oooh, I follow. Thanks for the clarification! We'll keep this open and revisit the priority :)