Create a Workbench compute pool
Compute pools provide the infrastructure that Workbench sessions, training jobs, and endpoints run on. This guide walks through creating a new pool from the Workbench interface.
Prerequisites
- A Kubernetes cluster in ready status
- Admin permissions on the workspace
Create a pool
- Click Workbench in the left sidebar, then click Pools.
- Click
New Pool. - Select the Kubernetes cluster the pool will be created on.
- Configure the pool:
- Name -- a descriptive identifier (for example,
gpu-trainingorcpu-general) - Role --
control(cluster management) orworker(compute workloads) - Instance types (AWS) -- click to open the instance browser and select one or more EC2 types
- Profile (non-AWS) -- choose Small (4 vCPU, 8 GiB), Medium (8 vCPU, 16 GiB), or Large (16 vCPU, 32 GiB)
- Min size -- minimum number of nodes
- Max size -- maximum number of nodes for autoscaling
- Name -- a descriptive identifier (for example,
- Optionally add Labels (key-value pairs for scheduling) and Taints (to restrict which workloads run on this pool).
- Click
Create.
tip
Set min=0 for scale-to-zero behaviour. Nodes are provisioned on demand and terminated when idle. This reduces costs for bursty workloads but adds 2--5 minutes of startup latency.
Verify the pool
The new pool appears in the Pools list with a node count and status. Sessions and training jobs can now target this pool when selecting compute resources.
Related
- Manage compute profiles -- configure named sizing options
- Manage compute pools -- edit and delete pools from the cluster detail page