aws / sagemaker-python-sdk

A library for training and deploying machine learning models on Amazon SageMaker
https://sagemaker.readthedocs.io/
Apache License 2.0
2.09k stars 1.13k forks source link

csv_weights cannot be set #1135

Open tigerhawkvok opened 4 years ago

tigerhawkvok commented 4 years ago

Reference: 0420645671

Please fill out the form below.

System Information

Describe the problem

Can't establish an XGBoost estimator with csv_weights set to 1, as per https://docs.aws.amazon.com/sagemaker/latest/dg/xgboost.html :

To differentiate the importance of labelled data points use Instance Weight Supports

Amazon SageMaker XGBoost allows customers to differentiate the importance of labelled data points by assigning each instance a weight value. For text/libsvm input, customers can assign weight values to data instances by attaching them after the labels. For example, label:weight idx_0:val_0 idx_1:val_1.... For text/csv input, customers need to turn on the csv_weights flag in the parameters and attach weight values in the column after labels. For example: label,weight,val_0,val_1,...).

Now, the docs don't say where the XGBoost class takes the argument, so I tried the obvious locations which all failed.

Minimal repro / logs

As an estimator paramter:

image

As a float in the estimator fitting:

image

As a string in the estimator fitting:

image

As a fit parameter:

image

ChoiByungWook commented 4 years ago

Reference: 0420645671