MachineSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)Specification of a single machine.
Attributes |
|
|---|---|
| Name | Description |
machine_type |
str
Immutable. The type of the machine. See the `list of machine types supported for prediction |
accelerator_type |
google.cloud.aiplatform_v1beta1.types.AcceleratorType
Immutable. The type of accelerator(s) that may be attached to the machine as per accelerator_count. |
accelerator_count |
int
The number of accelerators to attach to the machine. |
gpu_partition_size |
str
Optional. Immutable. The Nvidia GPU partition size. When specified, the requested accelerators will be partitioned into smaller GPU partitions. For example, if the request is for 8 units of NVIDIA A100 GPUs, and gpu_partition_size="1g.10gb", the service will create 8 \* 7 = 56 partitioned MIG instances. The partition size must be a value supported by the requested accelerator. Refer to `Nvidia GPU Partitioning |
tpu_topology |
str
Immutable. The topology of the TPUs. Corresponds to the TPU topologies available from GKE. (Example: tpu_topology: "2x2x1"). |
multihost_gpu_node_count |
int
Optional. Immutable. The number of nodes per replica for multihost GPU deployments. |
reservation_affinity |
google.cloud.aiplatform_v1beta1.types.ReservationAffinity
Optional. Immutable. Configuration controlling how this resource pool consumes reservation. |
min_gpu_driver_version |
str
Optional. Immutable. The minimum GPU driver version that this machine requires. For example, "535.104.06". If not specified, the default GPU driver version will be used by the underlying infrastructure. |
Methods
MachineSpec
MachineSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)Specification of a single machine.