AutoscalingMetricSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)
The metric specification that defines the target resource utilization (CPU utilization, accelerator's duty cycle, and so on) for calculating the desired replica count.
Attributes |
|
---|---|
Name | Description |
metric_name |
str
Required. The resource metric name. Supported metrics: - For Online Prediction: - aiplatform.googleapis.com/prediction/online/accelerator/duty_cycle
- aiplatform.googleapis.com/prediction/online/cpu/utilization
|
target |
int
The target resource utilization in percentage (1% - 100%) for the given metric; once the real usage deviates from the target by a certain percentage, the machine replicas change. The default value is 60 (representing 60%) if not provided. |