Skip to main content
SpatialReal currently provides avatar-only services, focusing on generating and rendering real-time avatar animations based on audio input. Voice conversation logic, speech synthesis, and other agent functionalities are managed by your application or third-party services.

1) Compare Integration Modes

ModeCharacteristicLatencyDev EffortIdeal Scenario
SDK ModeClient-centricModerate ⏱️Low 🟢Easy client-centric integration.
Framework PluginVoice agent framework pluginUltra-Low ⚡Low 🟢Use LiveKit Agents or TEN Framework.
RTC ModeTransport via Agora/LiveKitUltra-Low ⚡Medium 🟡You want to optimize latency, and use LiveKit or Agora as transport layer
Host ModeUse your own transport layerLow 💨High 🔴Full control over data transport.

2) Start from Runnable Examples

AvatarKit Voice Agent Demo

A collection of example projects that can work out of the box after API keys are filled in .env.
git clone https://github.com/spatialwalk/avatarkit-voice-agent-demo.git
cd avatarkit-voice-agent-demo

3) Guides

RTC Mode & Framework Plugin Guide

Use LiveKit or Agora for real-time delivery, including LiveKit Agents and TEN Framework integration.

4) Want to Learn More?

Read the full Integration Modes doc to understand the technical details, service architecture, and best practices for each integration mode.