Open achsvg opened 3 months ago
Placeholders like instance_id in the cloudwatch alarm config are not replaced.
instance_id
I sent a PR here but looks like it fell through the cracks.
python 3.11.x ray 2.30.0
[ { "EvaluationPeriods":1, "ComparisonOperator":"GreaterThanThreshold", "AlarmActions":[ "arn:aws:sns:us-west-2:xxx:yyy" ], "Namespace":"ray-CWAgent-{cluster_name}", "AlarmDescription":"Memory used exceeds 90 percent for 5 minutes", "Period":300, "Threshold":90.0, "AlarmName":"{cluster_name} high mem_used_percent {instance_id}", "Dimensions":[ { "Name":"InstanceId", "Value":"{instance_id}" } ], "Statistic":"Average", "InsufficientDataActions":[ ], "OKActions":[ ], "ActionsEnabled":true, "MetricName":"mem_used_percent" } ]
High: It blocks me from completing my task.
thanks ... add some labels we'll take a look at it. feels pretty straightforward.
Hi @achsvg, could you also share your Ray cluster YAML config file?
Hi @achsvg, added my comment in this PR
What happened + What you expected to happen
Placeholders like
instance_id
in the cloudwatch alarm config are not replaced.I sent a PR here but looks like it fell through the cracks.
Versions / Dependencies
python 3.11.x ray 2.30.0
Reproduction script
Issue Severity
High: It blocks me from completing my task.