A description of resources that are used for performing batch operations, are dedicated to a Model, and need manual configuration.
Required. Immutable. The specification of a single machine.
startingReplicaCountinteger
Immutable. The number of machine replicas used at the start of the batch operation. If not set, Vertex AI decides starting number, not greater than maxReplicaCount
maxReplicaCountinteger
Immutable. The maximum number of machine replicas the batch operation may be scaled to. The default value is 10.
Optional. Immutable. If set, use DWS resource to schedule the deployment workload. reference: (https://cloud.google.com/blog/products/compute/introducing-dynamic-workload-scheduler)
spotboolean
Optional. If true, schedule the deployment workload on spot VMs.
| JSON representation |
|---|
{ "machineSpec": { object ( |