Customer Experience Insights audio redaction

Customer Experience Insights audio redaction is an open-source framework that automatically detects and redacts sensitive personally identifiable information (PII) from audio recordings. It provides a secure, scalable, and customizable solution for organizations that need to process audio data while maintaining compliance with privacy regulations.

This solution acts as a middleware between your raw audio ingestion points and your long-term storage or analytics platforms. It automatically processes audio files to remove sensitive data such as:

Credit card numbers
Social Security numbers
Phone numbers
Email addresses
Custom entities defined by your organization

By redacting this information before permanent storage, you ensure that your data lakes and analytics tools remain free of toxic data, reducing compliance risk and data exposure.

Key features

Automated redaction: Uses Speech-to-Text and Sensitive Data Protection to transcribe, identify, and redact sensitive segments from audio files.
Dual trigger support: Flexible deployment options lets you trigger redaction either immediately upon file upload to Cloud Storage or downstream after processing by Customer Experience Insights.
Enterprise security: Designed with a "secure by default" architecture, Customer Experience Insights audio redaction supports private VPCs, VPC Service Controls, and supply-chain security checks to prevent unauthorized access and code tampering.
Scalability: Built on Dataflow, the pipeline automatically scales to handle massive volumes of audio data, making it suitable for high-throughput contact centers.
Verifiable output: Produces both the redacted audio file and a redaction transcript, enabling verification and audit trails.

How it works

The solution follows a linear processing pipeline:

Ingestion and triggering:
- Option A (Direct): An audio file is uploaded to a "raw" Cloud Storage bucket. A Cloud Run trigger immediately launches the pipeline.
- Option B (Insights): CX Insights processes a conversation. A Pub/Sub message triggers the Cloud Run service, which then launches the pipeline.
Processing (Dataflow). The Dataflow worker does the following:
1. Retrieves the audio file.
2. Sends the audio to Speech-to-Text to generate a time-stamped transcript.
3. Sends the transcript to Sensitive Data Protection to identify the time offsets of sensitive information.
4. Uses ffmpeg to silence the audio segments corresponding to the identified PII.
Output. A Cloud Run function does the following:
1. Optional: Moves the original raw audio to a secure "archive" bucket.
2. Writes the redacted audio back to the original location or a specified output bucket, replacing the sensitive segments with silence or a tone.

Get started

To use this solution, you need a Google Cloud project with billing enabled.

Prerequisites

Google Cloud project
gcloud CLI installed and authenticated
Enablement and access to Dataflow, Speech-to-Text, and Sensitive Data Protection Google Cloud services.

Access and deployment

The full source code and deployment instructions are available in the GitHub repository.