Closed aforadi closed 1 year ago
Quick update: I was able to run it as a job by removing the version restrictions on the packages below in the environment file:
azureml-mlflow azureml-default
Interactive still remains a challenge though..
aforadi, the error in interactive mode may have something to do with your VNET configuration. Can you check NSG in the subnet for any special restriction?
For interactive use, you may try downgrading the ray-on-aml version and fix version the protobuf package.
I'll investigate the root cause.
pip install ray-on-aml==0.1.8
ray_on_aml =Ray_On_AML(ws=ws, compute_cluster ="d15-v2",additional_pip_packages=['protobuf==3.20.1'], maxnode=2)
Please use the new ray-on-aml version
Hi @james-tn ,
Copying the issue from: https://github.com/james-tn/ray-on-aml/issues/24 with some modifications.
Thank you for this library. We are trying to use this library using the example code (https://github.com/microsoft/ray-on-aml/blob/master/examples/quick_start_examples.ipynb) in an interactive environment in Azure ML. The Jupyter notebook is a Python 3.8 Azure ML notebook. We are using the latest version of ray-on-aml 0.2.1
The image builds correctly on Azure ML. However, the cluster doesn't turn on. Below is what we see in the notebook:
And the following error inside the ray_on_aml experiment:
This error comes with both True and False for ci_is_head. All machines are inside the same VNET.
We are also facing an error while running as a job. Scripts below:
ray_test.py
ray_trigger.py
ray_conda_env.yml
We get the following error:
Let me know in case anything wrong with our setup or if this is an issue with the library.
Thanks a lot!