Generative AI use case: Generate solutions for customer-support questions

Last reviewed 2025-12-16 UTC

This document provides a high-level architecture for an application that uses AI to generate solutions for support questions from customers.

The intended audience for this document includes architects, developers, and administrators who build and manage generative AI applications in the cloud. The document assumes that you have a foundational understanding of generative AI.

The Deployment section of this document provides code samples for AI-assisted customer support use cases.

Architecture

The following diagram shows an architecture for an AI-assisted support desk application in Google Cloud. The application receives questions from customers, retrieves relevant resources from a knowledge base, and then it generates solutions for the questions. The architecture is an implementation of the retrieval-augmented generation (RAG) approach.

Architecture for an application that uses AI to generate responses to support requests from customers.

The application in this architecture consists of containerized services that are deployed in a Google Kubernetes Engine (GKE) cluster. The architecture shows the following flow:

A customer submits a question to the support desk application.
The support desk application passes the customer's question to the knowledge retriever service.
The knowledge retriever service constructs and sends a prompt to Gemini API in Vertex AI to retrieve resources that are relevant to the customer's question.
Gemini identifies relevant resources from a support knowledge base that's stored in Cloud Storage.
Gemini returns the IDs of the relevant resources to the knowledge retriever service.
The knowledge retriever service retrieves the relevant resources from Cloud Storage.
The knowledge retriever service sends the customer's question and relevant resources to the solution generator service.
The solution generator service sends the resources to Gemini API in Vertex AI, with a prompt to generate a detailed solution for the customer's question.
Gemini generates a solution, such as step-by-step instructions or a video walkthrough.
The solution generator service provides the solution to the customer through the support desk application.

Products used

This example architecture uses the following Google Cloud products:

Google Kubernetes Engine (GKE): A Kubernetes service that you can use to deploy and operate containerized applications at scale using Google's infrastructure.
Vertex AI: An ML platform that lets you train and deploy ML models and AI applications, and customize LLMs for use in AI-powered applications.
Cloud Storage: A low-cost, no-limit object store for diverse data types. Data can be accessed from within and outside Google Cloud, and it's replicated across locations for redundancy.

Deployment

To experiment with AI-assisted customer support applications in Google Cloud, use the following code samples:

What's next

Build a customer support agent using Gemini.
Build AI agents for customer support use cases by using playbooks, flows, and data stores in Dialogflow CX.
Explore more generative AI architecture guides.
For an overview of architectural principles and recommendations that are specific to AI and ML workloads in Google Cloud, see the AI and ML perspective in the Well-Architected Framework.
For more reference architectures, diagrams, and best practices, explore the Cloud Architecture Center.

Contributors

Author: Kumar Dhanagopal | Cross-Product Solution Developer

Other contributors:

Amina Mansour | Head of Cloud Platform Evaluations Team
Megan O'Keefe | Developer Advocate
Samantha He | Technical Writer
Shir Meir Lador | Developer Relations Engineering Manager

Generative AI use case: Generate solutions for customer-support questions Stay organized with collections Save and categorize content based on your preferences.