Skip to main content

Manage and delete a cluster

Rename, check status, reconnect, and delete Slurm and Kubernetes clusters.

Manage and delete a cluster

This guide covers common management tasks for Slurm and Kubernetes clusters after creation: checking status, updating details, reconnecting on-premises agents, and deleting a cluster.

Prerequisites

Check cluster status

  1. Click Clusters in the left sidebar and select the cluster type tab.
  2. The list shows each cluster with a status badge:
StatusMeaning
preparingProvisioning in progress
readyHealthy and accepting workloads
failedProvisioning or health check failed
destroyingTeardown in progress
  1. Click a cluster name to open the detail page. The Status Details section shows timestamps and error messages for the current state.

Update cluster details

  1. Open the cluster detail page.
  2. Click Edit to modify the cluster name or description.
  3. Click Save.

Reconnect an on-premises agent

If an on-premises cluster shows a stale or disconnected status:

  1. Open the cluster detail page.
  2. Click Reconnect in the actions menu.
  3. Verify the agent process is running on your infrastructure. Check the agent logs for connectivity errors.

If the agent cannot reach Vantage, confirm outbound HTTPS (port 443) is open from your infrastructure.

Delete a cluster

  1. Open the cluster detail page.
  2. Click Delete in the actions menu.
  3. Confirm the deletion in the dialog.
warning

Deleting a cluster is permanent. All running jobs are cancelled, all compute resources are deprovisioned, and any associated storage may be removed depending on the provider. Active clusters incur billing until deleted.

For cloud clusters (AWS, Azure, GCP), deletion triggers infrastructure teardown which may take several minutes. The cluster status changes to destroying until complete.

What to do when a cluster fails

If a cluster is stuck in failed status:

  1. Check the Status Details section on the detail page for error messages.
  2. Verify your cloud account credentials are valid (see Manage cloud accounts).
  3. See Troubleshoot clusters for common failure patterns.
Ask AI
Ask a question about Vantage Compute...