FetchModelServerVersionsRequest(
mapping=None, *, ignore_unknown_fields=False, **kwargs
)Request message for GkeInferenceQuickstart.FetchModelServerVersions.
.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields
Attributes |
|
|---|---|
| Name | Description |
model |
str
Required. The model for which to list model server versions. Open-source models follow the Huggingface Hub owner/model_name format. Use
GkeInferenceQuickstart.FetchModels
to find available models.
|
model_server |
str
Required. The model server for which to list versions. Open-source model servers use simplified, lowercase names (e.g., vllm). Use
GkeInferenceQuickstart.FetchModelServers
to find available model servers.
|
page_size |
int
Optional. The target number of results to return in a single response. If not specified, a default value will be chosen by the service. Note that the response may include a partial list and a caller should only rely on the response's next_page_token to determine if there are more instances left to be queried. This field is a member of oneof_ _page_size.
|
page_token |
str
Optional. The value of next_page_token received from a previous FetchModelServerVersionsRequest
call. Provide this to retrieve the subsequent page in a
multi-page list of results. When paginating, all other
parameters provided to FetchModelServerVersionsRequest
must match the call that provided the page token.
This field is a member of oneof_ _page_token.
|