Vertex AI v1beta1 API - Class MachineSpec (1.0.0-beta62)

public sealed class MachineSpec : IMessage<MachineSpec>, IEquatable<MachineSpec>, IDeepCloneable<MachineSpec>, IBufferMessage, IMessage

Reference documentation and code samples for the Vertex AI v1beta1 API class MachineSpec.

Specification of a single machine.

Inheritance

object > MachineSpec

Namespace

Google.Cloud.AIPlatform.V1Beta1

Assembly

Google.Cloud.AIPlatform.V1Beta1.dll

Constructors

MachineSpec()

public MachineSpec()

MachineSpec(MachineSpec)

public MachineSpec(MachineSpec other)
Parameter
Name Description
other MachineSpec

Properties

AcceleratorCount

public int AcceleratorCount { get; set; }

The number of accelerators to attach to the machine.

Property Value
Type Description
int

AcceleratorType

public AcceleratorType AcceleratorType { get; set; }

Immutable. The type of accelerator(s) that may be attached to the machine as per [accelerator_count][google.cloud.aiplatform.v1beta1.MachineSpec.accelerator_count].

Property Value
Type Description
AcceleratorType

GpuPartitionSize

public string GpuPartitionSize { get; set; }

Optional. Immutable. The Nvidia GPU partition size.

When specified, the requested accelerators will be partitioned into smaller GPU partitions. For example, if the request is for 8 units of NVIDIA A100 GPUs, and gpu_partition_size="1g.10gb", the service will create 8 * 7 = 56 partitioned MIG instances.

The partition size must be a value supported by the requested accelerator. Refer to Nvidia GPU Partitioning for the available partition sizes.

If set, the accelerator_count should be set to 1.

Property Value
Type Description
string

MachineType

public string MachineType { get; set; }

Immutable. The type of the machine.

See the list of machine types supported for prediction

See the list of machine types supported for custom training.

For [DeployedModel][google.cloud.aiplatform.v1beta1.DeployedModel] this field is optional, and the default value is n1-standard-2. For [BatchPredictionJob][google.cloud.aiplatform.v1beta1.BatchPredictionJob] or as part of [WorkerPoolSpec][google.cloud.aiplatform.v1beta1.WorkerPoolSpec] this field is required.

Property Value
Type Description
string

MinGpuDriverVersion

public string MinGpuDriverVersion { get; set; }

Optional. Immutable. The minimum GPU driver version that this machine requires. For example, "535.104.06". If not specified, the default GPU driver version will be used by the underlying infrastructure.

Property Value
Type Description
string

MultihostGpuNodeCount

public int MultihostGpuNodeCount { get; set; }

Optional. Immutable. The number of nodes per replica for multihost GPU deployments.

Property Value
Type Description
int

ReservationAffinity

public ReservationAffinity ReservationAffinity { get; set; }

Optional. Immutable. Configuration controlling how this resource pool consumes reservation.

Property Value
Type Description
ReservationAffinity

TpuTopology

public string TpuTopology { get; set; }

Immutable. The topology of the TPUs. Corresponds to the TPU topologies available from GKE. (Example: tpu_topology: "2x2x1").

Property Value
Type Description
string