Reference documentation and code samples for the Cloud Speech-to-Text V2 API module Google::Cloud::Speech::V2::ExplicitDecodingConfig::AudioEncoding.
Supported audio data encodings.
Constants
AUDIO_ENCODING_UNSPECIFIED
value: 0
Default value. This value is unused.
LINEAR16
value: 1
Headerless 16-bit signed little-endian PCM samples.
MULAW
value: 2
Headerless 8-bit companded mulaw samples.
ALAW
value: 3
Headerless 8-bit companded alaw samples.
AMR
value: 4
AMR frames with an rfc4867.5 header.
AMR_WB
value: 5
AMR-WB frames with an rfc4867.5 header.
FLAC
value: 6
FLAC frames in the "native FLAC" container format.
MP3
value: 7
MPEG audio frames with optional (ignored) ID3 metadata.
OGG_OPUS
value: 8
Opus audio frames in an Ogg container.
WEBM_OPUS
value: 9
Opus audio frames in a WebM container.
MP4_AAC
value: 10
AAC audio frames in an MP4 container.
M4A_AAC
value: 11
AAC audio frames in an M4A container.
MOV_AAC
value: 12
AAC audio frames in an MOV container.