- NAME
- 
- gcloud ai endpoints stream-direct-predict - run Vertex AI online stream direct prediction
 
- SYNOPSIS
- 
- 
gcloud ai endpoints stream-direct-predict(ENDPOINT:--region=REGION)--json-request=JSON_REQUEST[GCLOUD_WIDE_FLAG …]
 
- 
- DESCRIPTION
- 
gcloud ai endpoints stream-direct-predictsends a stream direct prediction request to Vertex AI endpoint for the given inputs. The request limit is 10MB.
- EXAMPLES
- 
To stream direct predict against an endpoint
123exampleus-central1gcloud ai endpoints stream-direct-predict 123 --project=example --region=us-central1 --json-request=input.json
- POSITIONAL ARGUMENTS
- 
- 
Endpoint resource - The endpoint to do online stream direct prediction. The
arguments in this group can be used to specify the attributes of this resource.
(NOTE) Some attributes are not given arguments in this group but can be set in
other ways.
To set the projectattribute:- 
provide the argument endpointon the command line with a fully specified name;
- 
provide the argument --projecton the command line;
- 
set the property core/project.
 This must be specified. - ENDPOINT
- 
ID of the endpoint or fully qualified identifier for the endpoint.
To set the nameattribute:- 
provide the argument endpointon the command line.
 This positional argument must be specified if any of the other arguments in this group are specified. 
- 
provide the argument 
- --region=- REGION
- 
Cloud region for the endpoint.
To set the regionattribute:- 
provide the argument endpointon the command line with a fully specified name;
- 
provide the argument --regionon the command line;
- 
set the property ai/region;
- choose one from the prompted list of available regions.
 
- 
provide the argument 
 
- 
provide the argument 
 
- 
Endpoint resource - The endpoint to do online stream direct prediction. The
arguments in this group can be used to specify the attributes of this resource.
(NOTE) Some attributes are not given arguments in this group but can be set in
other ways.
- REQUIRED FLAGS
- 
- --json-request=- JSON_REQUEST
- 
Path to a local file containing the body of a JSON request.
An example of a JSON request: { "inputs": [ {"dtype": "STRING", shape: [1], "string_val": ["hello world"]}, {"dtype": "INT32", shape: [1], "int_val": [42]} ] } This flag accepts "-" for stdin. 
 
- GCLOUD WIDE FLAGS
- 
These flags are available to all commands: --access-token-file,--account,--billing-project,--configuration,--flags-file,--flatten,--format,--help,--impersonate-service-account,--log-http,--project,--quiet,--trace-token,--user-output-enabled,--verbosity.Run $ gcloud helpfor details.
- NOTES
- 
These variants are also available:
gcloud alpha ai endpoints stream-direct-predictgcloud beta ai endpoints stream-direct-predict
      gcloud ai endpoints stream-direct-predict
  
  Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-07-22 UTC.