You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I had searched in the issues and found no similar feature requirement.
Description
In production, ensuring that Ray cluster creation adheres to strict SLAs is crucial. Any unexpected delays in cluster creation can stem from various factors, such as prolonged image pull times, node creation delays, or suboptimal KubeRay settings. Currently, this process requires manual monitoring, which is cumbersome and prone to missing critical events or edge cases. Automating this monitoring is essential to improve reliability and efficiency.
Use case
Deeper insights and observability
Related issues
No response
Are you willing to submit a PR?
Yes I am willing to submit a PR!
The text was updated successfully, but these errors were encountered:
Search before asking
Description
In production, ensuring that Ray cluster creation adheres to strict SLAs is crucial. Any unexpected delays in cluster creation can stem from various factors, such as prolonged image pull times, node creation delays, or suboptimal KubeRay settings. Currently, this process requires manual monitoring, which is cumbersome and prone to missing critical events or edge cases. Automating this monitoring is essential to improve reliability and efficiency.
Use case
Deeper insights and observability
Related issues
No response
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: