Gemini Enterprise Agent Platform Vision (formerly Vertex AI Vision) is an AI-powered platform to ingest, analyze and store video data. Agent Platform Vision lets users build and deploy applications with a simplified user interface.
Using Agent Platform Vision you can build end-to-end computer image solutions by leveraging Agent Platform Vision's integration with other major components, namely Live Video Analytics, data streams, and Vision Warehouse. The Gemini Enterprise Agent Platform Vision API lets you build a high level app from low level APIs, and create and update a high level workflow that combines multiple individual API calls. You can then execute your workflow as a unit by making a single deploy request to the Agent Platform Vision platform server.
Using Agent Platform Vision, you can:
- Ingest real-time video data
- Analyze data for insights using general and custom vision AI models
- Store insights in Vision Warehouse for simplified querying and metadata information
Agent Platform Vision workflow
The steps you complete to use Agent Platform Vision are as follows:
Ingest real-time data
Agent Platform Vision's architecture lets you quickly and conveniently stream real-time video ingestion infrastructure in a public Cloud.
Analyze data
After data is ingested, Agent Platform Vision's framework provides you with straightforward access and orchestration of a large and growing portfolio of general, custom, & specialized analysis models.
Store and query output
After your app analyzes your data you can send this information to a storage destination (Vision Warehouse or BigQuery), or receive the data live. With Vision Warehouse you can send your app output to a warehouse that generalizes your search work and serves multiple data types and use cases.
A note on Responsible AI
At Google Cloud, we prioritize helping customers safely develop and implement solutions using Gemini Enterprise Agent Platform Vision. For Gemini Enterprise Agent Platform Vision, we've worked to develop fair and equitable performance in accordance with Google's AI Principles.
This work includes testing for bias during development, for example looking at performance across different skin tones, and developing product features to enhance privacy and limit personal identification, like person and face blur. We are committed to iterating and improving, and we will continue to incorporate best practices and lessons learned into our Vertex AI products.
When Gemini Enterprise Agent Platform Vision is integrated into a customer's unique organizational context, there are likely to be additional responsible AI considerations. We encourage customers to leverage fairness, interpretability, privacy and security best practices when implementing Gemini Enterprise Agent Platform Vision, especially when building custom or AutoML trained models. Throughout this technical documentation, we have provided additional guidance and resources to support this work. To learn more, read about Google's recommendations for Responsible AI practices.
What's next
- Read more in the blog post "Vertex AI Vision: Easily build and deploy computer vision applications at scale".
- Learn details about specific models in the Occupancy analytics guide, Person blur guide, Person/vehicle detector guide, or Motion filtering guide.
- Try Agent Platform Vision in the Google Cloud console by reading the Build an app in the console quickstart.
- Set up your local environment to use Agent Platform Vision.