Base64 編碼音訊內容

將音訊資料傳送至 Speech-to-Text API 時，您可以直接傳送資料 (在要求的 content 欄位中)，也可以在要求的 uri 欄位中指定儲存物件，讓 API 對儲存在 Cloud Storage 值區的資料執行遠端辨識。

HTTP 要求中 content 欄位的音訊資料必須採用 Base64 格式。本頁說明如何將二進位檔案中的音訊轉換為 base64 編碼資料。

使用指令列

在 gRPC 要求內，您只需直接寫出二進位資料即可；但是，當提出 REST 要求時，會使用 JSON。JSON 是文字格式，不直接支援二進位資料，因此您需要使用 Base64 編碼，將這類二進位資料轉換為文字。

大多數開發環境都包含原生 base64 公用程式，可將二進位編碼為 ASCII 文字資料。如要編碼檔案：

Linux

使用 base64 指令列工具對檔案進行編碼，並使用 -w 0 標記確保不會換行：

base64 INPUT_FILE -w 0 > OUTPUT_FILE

macOS

使用 base64 指令列工具編碼檔案：

base64 -i INPUT_FILE -o OUTPUT_FILE

Windows

使用 Base64.exe 工具編碼檔案：

Base64.exe -e INPUT_FILE > OUTPUT_FILE

PowerShell

使用 Convert.ToBase64String 方法對檔案進行編碼：

[Convert]::ToBase64String([IO.File]::ReadAllBytes("./INPUT_FILE")) > OUTPUT_FILE

建立 JSON 要求檔案，並內嵌 Base64 編碼資料：

JSON

{
  "recognizer": "projects/PROJECT_ID/locations/global/recognizers/_",
  "content": "ZkxhQwAAACIQABAAAAUJABtAA+gA8AB+W8FZndQvQAyjv..."
}

使用用戶端程式庫

透過文字編輯器將二進位資料嵌入要求中，既不理想也不實用。實際上，您會將 base64 編碼的檔案嵌入用戶端程式碼內。所有支援的程式設計語言都擁有適用於 base64 編碼內容的內建機制。

Python

在 Python 中，會以下列方式對音訊檔案進行 base64 編碼：

# Import the base64 encoding library.
import base64

# Pass the audio data to an encoding function.
def encode_audio(audio_file):
    with open(audio_file, "rb") as f:
        encoded_content = base64.b64encode(f.read())
    return encoded_content

Node.js

// Read the file into memory.
var fs = require('fs');
var audioFile = fs.readFileSync('/full/path/to/audio/file.wav');

// Convert the audio data to a Buffer and base64 encode it.
var encoded = Buffer.from(audioFile).toString('base64');

Java

// Import the Base64 encoding library.
import org.apache.commons.codec.binary.Base64;

// Encode the audio.
byte[] audioData = Base64.encodeBase64(audioFile.getBytes());
String encodedString = Base64.getEncoder().encodeToString(audioData);

Go

import (
    "bufio"
    "encoding/base64"
    "io"
    "os"
)

// Open audio file.
f, _ := os.Open("/full/path/to/audio/file.wav")

// Read entire audio into byte slice.
reader := bufio.NewReader(f)
content, _ := io.ReadAll(reader)

// Encode audio as base64.
base64.StdEncoding.EncodeToString(content)