Interface StorageConfigOrBuilder (0.1.0)

public interface StorageConfigOrBuilder extends MessageOrBuilder

Implements

MessageOrBuilder

Methods

getModelBucketUri()

public abstract String getModelBucketUri()

Optional. The Google Cloud Storage bucket URI to load the model from. This URI must point to the directory containing the model's config file (config.json) and model weights. A tuned GCSFuse setup can improve LLM Pod startup time by more than 7x. Expected format: gs://<bucket-name>/<path-to-model>.

string model_bucket_uri = 1 [(.google.api.field_behavior) = OPTIONAL];

Returns
Type Description
String

The modelBucketUri.

getModelBucketUriBytes()

public abstract ByteString getModelBucketUriBytes()

Optional. The Google Cloud Storage bucket URI to load the model from. This URI must point to the directory containing the model's config file (config.json) and model weights. A tuned GCSFuse setup can improve LLM Pod startup time by more than 7x. Expected format: gs://<bucket-name>/<path-to-model>.

string model_bucket_uri = 1 [(.google.api.field_behavior) = OPTIONAL];

Returns
Type Description
ByteString

The bytes for modelBucketUri.

getXlaCacheBucketUri()

public abstract String getXlaCacheBucketUri()

Optional. The URI for the GCS bucket containing the XLA compilation cache. If using TPUs, the XLA cache will be written to the same path as model_bucket_uri. This can speed up vLLM model preparation for repeated deployments.

string xla_cache_bucket_uri = 2 [(.google.api.field_behavior) = OPTIONAL];

Returns
Type Description
String

The xlaCacheBucketUri.

getXlaCacheBucketUriBytes()

public abstract ByteString getXlaCacheBucketUriBytes()

Optional. The URI for the GCS bucket containing the XLA compilation cache. If using TPUs, the XLA cache will be written to the same path as model_bucket_uri. This can speed up vLLM model preparation for repeated deployments.

string xla_cache_bucket_uri = 2 [(.google.api.field_behavior) = OPTIONAL];

Returns
Type Description
ByteString

The bytes for xlaCacheBucketUri.