You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Some issues occur only at large scale. In such cases, most computations have already been completed, and deadlines are sometimes stringent, leaving users with no opportunity to rerun the entire job. Therefore, we should provide CLIs to enable users to terminate specific Ray tasks or actor processes, triggering retries of failed tasks so that the job will not hang forever due to some corner cases.
This is useful when users need to babysit some workloads.