Summary of entries of Classes for google-cloud-gkerecommender.
Classes
GkeInferenceQuickstartAsyncClient
GKE Inference Quickstart (GIQ) service provides profiles with performance metrics for popular models and model servers across multiple accelerators. These profiles help generate optimized best practices for running inference on GKE.
GkeInferenceQuickstartClient
GKE Inference Quickstart (GIQ) service provides profiles with performance metrics for popular models and model servers across multiple accelerators. These profiles help generate optimized best practices for running inference on GKE.
FetchModelServerVersionsAsyncPager
A pager for iterating through fetch_model_server_versions requests.
This class thinly wraps an initial
FetchModelServerVersionsResponse object, and
provides an __aiter__ method to iterate through its
model_server_versions field.
If there are more pages, the __aiter__ method will make additional
FetchModelServerVersions requests and continue to iterate
through the model_server_versions field on the
corresponding responses.
All the usual FetchModelServerVersionsResponse attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.
FetchModelServerVersionsPager
A pager for iterating through fetch_model_server_versions requests.
This class thinly wraps an initial
FetchModelServerVersionsResponse object, and
provides an __iter__ method to iterate through its
model_server_versions field.
If there are more pages, the __iter__ method will make additional
FetchModelServerVersions requests and continue to iterate
through the model_server_versions field on the
corresponding responses.
All the usual FetchModelServerVersionsResponse attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.
FetchModelServersAsyncPager
A pager for iterating through fetch_model_servers requests.
This class thinly wraps an initial
FetchModelServersResponse object, and
provides an __aiter__ method to iterate through its
model_servers field.
If there are more pages, the __aiter__ method will make additional
FetchModelServers requests and continue to iterate
through the model_servers field on the
corresponding responses.
All the usual FetchModelServersResponse attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.
FetchModelServersPager
A pager for iterating through fetch_model_servers requests.
This class thinly wraps an initial
FetchModelServersResponse object, and
provides an __iter__ method to iterate through its
model_servers field.
If there are more pages, the __iter__ method will make additional
FetchModelServers requests and continue to iterate
through the model_servers field on the
corresponding responses.
All the usual FetchModelServersResponse attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.
FetchModelsAsyncPager
A pager for iterating through fetch_models requests.
This class thinly wraps an initial
FetchModelsResponse object, and
provides an __aiter__ method to iterate through its
models field.
If there are more pages, the __aiter__ method will make additional
FetchModels requests and continue to iterate
through the models field on the
corresponding responses.
All the usual FetchModelsResponse attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.
FetchModelsPager
A pager for iterating through fetch_models requests.
This class thinly wraps an initial
FetchModelsResponse object, and
provides an __iter__ method to iterate through its
models field.
If there are more pages, the __iter__ method will make additional
FetchModels requests and continue to iterate
through the models field on the
corresponding responses.
All the usual FetchModelsResponse attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.
FetchProfilesAsyncPager
A pager for iterating through fetch_profiles requests.
This class thinly wraps an initial
FetchProfilesResponse object, and
provides an __aiter__ method to iterate through its
profile field.
If there are more pages, the __aiter__ method will make additional
FetchProfiles requests and continue to iterate
through the profile field on the
corresponding responses.
All the usual FetchProfilesResponse attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.
FetchProfilesPager
A pager for iterating through fetch_profiles requests.
This class thinly wraps an initial
FetchProfilesResponse object, and
provides an __iter__ method to iterate through its
profile field.
If there are more pages, the __iter__ method will make additional
FetchProfiles requests and continue to iterate
through the profile field on the
corresponding responses.
All the usual FetchProfilesResponse attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.
Amount
Represents an amount of money in a specific currency.
Cost
Cost for running a model deployment on a given instance type. Currently, only USD currency code is supported.
.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields
FetchBenchmarkingDataRequest
Request message for GkeInferenceQuickstart.FetchBenchmarkingData.
FetchBenchmarkingDataResponse
Response message for GkeInferenceQuickstart.FetchBenchmarkingData.
FetchModelServerVersionsRequest
Request message for GkeInferenceQuickstart.FetchModelServerVersions.
.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields
FetchModelServerVersionsResponse
Response message for GkeInferenceQuickstart.FetchModelServerVersions.
FetchModelServersRequest
Request message for GkeInferenceQuickstart.FetchModelServers.
.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields
FetchModelServersResponse
Response message for GkeInferenceQuickstart.FetchModelServers.
FetchModelsRequest
Request message for GkeInferenceQuickstart.FetchModels.
.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields
FetchModelsResponse
Response message for GkeInferenceQuickstart.FetchModels.
FetchProfilesRequest
Request message for GkeInferenceQuickstart.FetchProfiles.
.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields
FetchProfilesResponse
Response message for GkeInferenceQuickstart.FetchProfiles.
GenerateOptimizedManifestRequest
Request message for GkeInferenceQuickstart.GenerateOptimizedManifest.
GenerateOptimizedManifestResponse
Response message for GkeInferenceQuickstart.GenerateOptimizedManifest.
KubernetesManifest
A Kubernetes manifest.
MillisecondRange
Represents a range of latency values in milliseconds.
ModelServerInfo
Model server information gives. Valid model server info combinations can be found using GkeInferenceQuickstart.FetchProfiles.
PerformanceRange
Performance range for a model deployment.
PerformanceRequirements
Performance requirements for a profile and or model deployment.
.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields
PerformanceStats
Performance statistics for a model deployment.
Profile
A profile containing information about a model deployment.
ResourcesUsed
Resources used by a model deployment.
StorageConfig
Storage configuration for a model deployment.
TokensPerSecondRange
Represents a range of throughput values in tokens per second.
Modules
pagers
API documentation for gkerecommender_v1.services.gke_inference_quickstart.pagers module.