Class MachineSpec (1.151.0)

MachineSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)

Specification of a single machine.

Attributes

Name Description
machine_type str
Immutable. The type of the machine. See the `list of machine types supported for prediction
accelerator_type google.cloud.aiplatform_v1beta1.types.AcceleratorType
Immutable. The type of accelerator(s) that may be attached to the machine as per accelerator_count.
accelerator_count int
The number of accelerators to attach to the machine.
gpu_partition_size str
Optional. Immutable. The Nvidia GPU partition size. When specified, the requested accelerators will be partitioned into smaller GPU partitions. For example, if the request is for 8 units of NVIDIA A100 GPUs, and gpu_partition_size="1g.10gb", the service will create 8 \* 7 = 56 partitioned MIG instances. The partition size must be a value supported by the requested accelerator. Refer to `Nvidia GPU Partitioning
tpu_topology str
Immutable. The topology of the TPUs. Corresponds to the TPU topologies available from GKE. (Example: tpu_topology: "2x2x1").
multihost_gpu_node_count int
Optional. Immutable. The number of nodes per replica for multihost GPU deployments.
reservation_affinity google.cloud.aiplatform_v1beta1.types.ReservationAffinity
Optional. Immutable. Configuration controlling how this resource pool consumes reservation.
min_gpu_driver_version str
Optional. Immutable. The minimum GPU driver version that this machine requires. For example, "535.104.06". If not specified, the default GPU driver version will be used by the underlying infrastructure.

Methods

MachineSpec

MachineSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)

Specification of a single machine.