Open czlaugh opened 2 months ago
Please ensure that your autogluon.cloud
is updated to 0.4.x and sagemaker
to above 2.220. Try running the given code to successfully set up the ray cluster.
import pandas as pd
from autogluon.cloud import TabularCloudPredictor
import boto3
train_data = pd.read_csv("https://autogluon.s3.amazonaws.com/datasets/Inc/train.csv")
test_data.drop(columns=["class"], inplace=True)
predictor_init_args = {
"label": "class"
}
predictor_fit_args = {
"train_data": train_data,
"time_limit": 120
}
cloud_predictor = TabularCloudPredictor(
cloud_output_path="<your_s3_path>",
backend="ray_aws"
).fit(
predictor_init_args=predictor_init_args,
predictor_fit_args=predictor_fit_args,
instance_type="ml.m5.4xlarge",
wait=True,
)
Please give this code a try and let us know if it resolves your issue.
When making this call, the head node EC2 server is established:
After the head node is setup, the API call produces this output ending with a complaint about rsync file missing.