Closed Ericxgao closed 1 year ago
Hi, I'm a bot from the Ray team :)
To help human contributors to focus on more relevant issues, I will automatically add the stale label to issues that have had no activity for more than 4 months.
If there is no further activity in the 14 days, the issue will be closed!
You can always ask for help on our discussion forum or Ray's public slack channel.
Hi again! The issue will be closed because there has been no more activity in the 14 days since the last message.
Please feel free to reopen or open a new issue if you'd still like it to be addressed.
Again, you can always ask for help on our discussion forum or Ray's public slack channel.
Thanks again for opening the issue!
What happened + What you expected to happen
I have the following deploy yaml:
When I run this, or if I manually start the head with the autoscaling_config, the worker will continue to crash loop with the raylet error:
Everything works if I manually start the servers myself and use ray start without specifying an autoscaling config - however, this is quite tedious if I want to scale up to more servers.
Versions / Dependencies
Ray 2.1.0 Python 3.9.12 Ubuntu 20.04
Reproduction script
ray bootstrap config:
Issue Severity
High: It blocks me from completing my task.