Skip to main content

Supported Input Audio

SpatialReal services currently accept mono 16-bit PCM (s16le) audio as raw PCM bytes.

Format

Raw PCM bytes in s16le format

Channels

1 channel (mono)

Bit Depth

16-bit samples

Sample Rates

8000 to 48000 Hz from the supported set below

Specification

PropertyValue
Sample RateOne of 8000, 16000, 22050, 24000, 32000, 44100, 48000
Channels1 (mono)
Bit Depth16-bit
FormatRaw PCM bytes
PCM Encodings16le
If your source audio uses stereo channels, floating point samples, compressed codecs, or an unsupported sample rate, convert it before sending it to SpatialReal.

What s16le Means

  • s16 means each audio sample is a signed 16-bit integer.
  • le means the bytes are stored in little-endian order.
  • mono means the stream contains a single channel only.

Common Conversion Checklist

Before sending audio to the SDK or service, make sure your pipeline outputs:
  • a supported sample rate
  • mono audio
  • 16-bit signed PCM samples
  • raw PCM bytes rather than WAV headers or compressed audio frames
For a practical overview of common audio encodings and container formats, see the FFmpeg audio types reference.