Closed phossen closed 3 years ago
Hi @phossen,
Thank you for reporting this. The toolkit we are using is Ray (i.e., RLToolkit.RAY
).
This error is due to updating the SageMaker SDK.
For a quick fix, please edit the image_name
argument to image_uri
as shown below.
estimator = RLEstimator(entry_point="train-mabs.py",
source_dir='training/training_src',
dependencies=["training/common/sagemaker_rl", "inference/inference_src/", "../BattlesnakeGym/"],
image_uri=image_name,
role=role,
train_instance_type=instance_type,
train_instance_count=1,
output_path=s3_output_path,
base_job_name=job_name_prefix,
metric_definitions=metric_definitions,
hyperparameters={
# See train-mabs.py to add additional hyperparameters
# Also see ray_launcher.py for the rl.training.* hyperparameters
"num_iters": 10,
# number of snakes in the gym
"num_agents": num_agents,
"iterate_map_size": False,
"map_size": map_size,
"algorithm": algorithm,
"additional_configs": additional_config,
"use_heuristics_action_masks": False
}
)
Thank you, fixed it!
Trying to follow the instructions from the markdown files, I struggle with the
RLLibEnv/2_PolicyTraining.ipynb
. In the cell which starts the training, the RLEstimator expects three further argumentstoolkit
,toolki_version
, andframework
. I fixed this with the following lines:After fixing that, the next problem occurred. When the RLEstimator is calling the
train-mabs.py
with the parameters. It seems to lack an installation of the requirements.txt in the created docker container. Ray is not installed, but doesn't seem to be the only problem. Output: