EXTENSION

3D Spatial Audio

Add dynamic, immersive audio to your real-time experience
A woman meditating in a living room wearing a virtual reality headset, surrounded by holographic jellyfish and immersed in spatial audio effects.
Supported Platforms
Android
iOS
Windows
EXTENSION

3D Spatial Audio

Add dynamic, immersive audio to your real-time experience
Supported Platforms
Android
iOS
Windows
Customers building with
Agora and OpenAI
grepp logoWYZE logokileon logokumu logoScaler logoParallel logoJorJin logoAnotherBall logoEllie logozigbang logo
grepp logoWYZE logokileon logokumu logoScaler logoParallel logoJorJin logoAnotherBall logoEllie logozigbang logo

Features

Natural listening experience icon
Natural listening experience 
Support for high-quality audio range, audio playback, background blur, air attenuation and more, perfectly simulating a natural listening experience.
Highest fidelity 3D audio icon
Highest fidelity 3D audio
Supports 48kHz full-band sampling and allows listeners to pinpoint both the direction and distance of a voice coming from the speaker.
Low Latency icon
Low latency
Low latency, low power consumption, and efficient processing modes preserve the real-time experience.
Cross-platform support icon
Cross-platform support
Agora’s streaming 3D audio API has support for Web, iOS, Android, Mac, Windows, Unity, React Native, and Electron.
Global scalability icon
Global scalability
Scale from 1:1 to millions of users on the network that annually powers hundreds of billions of minutes of real-time video to users in over 200 countries and regions.
Talk to a voice agent powered by the Conversational AI Engine
Try it now
One real-time view for the metrics that matter the most
Use a single dashboard to monitor every active session around the world. Track the metrics that are most important to you, from concurrent users and channels to network latency and so much more.

Your vision, unrestricted.

With Interactive Whiteboard, you can build a collaborative app fast—with custom branding and full of features. Our platform makes it easy to create a customized and engaging learning environment.
  • Flexible APIs support custom branding and extensive digital whiteboard features.
  • Easily integrate real-time voice and video calling, interactive streaming and signaling.
  • Save users’ bandwidth by preloading, sharing, and annotating files, and retain all the dynamic content.
And have peace of mind with HIPAA, GDPR, and CCPA compliance.

See OpenAI's Realtime API in action

Deliver a more natural audio experience

Make your product stand out with Agora’s 3D Spatial Audio API that boosts user engagement.
Deliver a more realistic audio experience icon

Deliver a more realistic audio experience

Replicate how we hear sound in the real world for a more natural experience that makes users feel like they are in the same room.
Deliver a more realistic audio experience icon

Deliver a more realistic audio experience

Integrate quickly and easily icon

Integrate quickly and easily

Quickly make your user experience more immersive by activating Agora’s 3D Spatial Audio extension that works seamlessly with our video, voice, and streaming products.
Integrate quickly and easily icon

Integrate quickly and easily

Give users the best audio quality icon

Give users the best audio quality

Allow your audience to hear deeper nuances of music and spoken word with superior audio that elevates the quality of the user’s entire experience.
Give users the best audio quality icon

Give users the best audio quality

Recording options for:

Cloud recording
Store, retrieve and share recordings in the cloud.
Go to Docs
On-premise recording
Store on a local server for security and confidentiality.
Go to Docs
Webpage recording
Record the entire web browser screen experience.
Go to Docs

Agora Media Services

Recording icon
Recording
Record audio streams, video streams and web pages for archive, review, or distribution.
Live icon
Media Gateway
Directly push media streams into Agora voice and video channels using the RTMP/SRT protocol and enable advanced transcoding processing on media streams to facilitate distribution.
Download icon
Media Pull
Add additional engagement to your Agora sessions by  pulling live or recorded video and audio content and ingesting directly into your Agora channel.
Media Push
Expand your audience with hybrid engagement experiences by pushing audio and video streams from Agora channels to Content Delivery Networks (CDN).

Made for developers

Quickstart guide

View the quickstart guide to get up and running with Agora and Open AI.
How the Conversational AI Engine works

Made for developers

Your Code

Agora SDK

Customize your experience from the start with our flexible SDK.
Your Code

Agora SDK

Build and integrate real-time video into your app with the most flexibility and  customization using Agora's Video SDK.
NO CODE

App Builder

Agora’s App Builder is the fastest and easiest way to real-time video into your product using our no-code visual designer.
Go to Docs
low code

Agora UI Kit

Add real-time video to your app with only a few lines of code using low-code UI Kit libraries.
Go to Docs
your code

Agora SDK

Customize your experience from the start with our flexible SDK.
Android
iOS
Windows
Go to Docs
low code

Agora UI Kit

Integrate real-time communication and streaming using only a few lines of code with low-code UIKit libraries.
Go to Docs

Documentation

Documentation

This project presents you a set of API examples to help you understand how to use Agora APIs.
View documentation on how to set up 3D Spatial Audio.
Android
iOS
Windows
Go to Docs

Activate Extension

Activate the AI Noise Suppression extension on the Agora Console.

Activate the 3D Spatial Audio extension in the Agora Console.

Go to Console
your code

Agora SDK

Build and integrate Voice Calling with the most flexibility and full customization using Agora's Voice SDK.
Android
iOS
Windows
Go to Docs
NO code

App Builder

Agora’s App Builder is the fastest and easiest way to add real-time voice chat, video chat, and live streaming into your product.
Go to Docs
your code

Agora SDK

Build and integrate real-time visual collaboration features into your application with the most flexibility and full customization using Agora's Interactive Whiteboard SDK.
Android
iOS
Windows
Go to Docs
LOW code

Fastboard

Build real-time visual collaboration faster with a pre-built UI and the ability to include custom plug ins.
Try it Now
Security, privacy and compliance
Agora is certified to the ISO/IEC 27001, 27017, 27018, 27701 and SOC 2 security standards and meets privacy regulations like GDPR, CCAP, COPPA, and HIPAA. Agora doesn’t collect or store any end-user data aside from Internet Protocol (IP) addresses and operational information necessary for providing our services.
ISO 27001:2022
ISO 27017:2015
ISO 27018:2019
ISO 27701:2019
HIPAA
GDPR
SOC2 Type1&2
CCPA
COPPA
HOW TO INTEGRATE?
Streamlined 3-step integration process:
01
Activate Agora Conversational AI Engine
Unlock real-time Speech-to-Text (STT) and Text-to-Speech (TTS) capabilities, enabling seamless conversational interactions. 
02
Integrate Agora Edge Chip on Hardware
Optimize microphone, speaker, and system efficiency to ensure ultra-low-latency and high-fidelity conversations.
03
Deploy AI Voice Agents
Enable interactive, multilingual, and user-customized conversations for a wide range of IoT applications.
By building our Conversational AI technology into Beken’s high-performance IoT chip modules, the turnkey solution makes it easy to integrate voice AI into any connected toy. 
“With Agora’s conversational AI technology and our optimized AI hardware, we’re enabling the next generation of toys to think, respond, and interact naturally. We are excited to usher in the future of robotics and toys, ones that can react to the environment around them and interact fluently with users.” 
Pengfei Zhang
CEO, BEKEN
Use cases

Provide an exceptional immersive sound experience

A livecast of a gaming session with three players.
Livecasting
Create a more personal environment, as if friends are sharing same physical space.
A man is a on conference call next to several others, powered by 3D spatial audio which allows participants to hear him clearly.
Meetings / Conference calls
Make meetings more productive by allowing participants to focus on the main speaker—not background noises
A young child on a live video call on a laptop with his teacher and immersed in the lesson.
Education
Enrich the learning experience by making it more personal and memorable—as if the teacher is sitting next to the student.
A video of a live musical concert, providing an immersive experience and allowing listeners to enjoy the nuances in every note.
Music Streaming
Provide a fully immersive experience allowing listeners to enjoy the nuances in every note.
Robopoet's Fuzzoo, an AI companion robot, leverages Agora's ConvoAI Device Kit to deliver real-time emotional support and personalized interaction.
"Agora’s AI technology enables toys and robots to interact in a way that feels natural and engaging. With real-time voice processing, emotional AI, and advanced speech capabilities, Agora makes seamless human-machine interaction possible and ensures exceptional performance and reliability." 
Yuna Pan
Co-Founder and CTO
Mouse cursor illustration

Fastboard

Easily build and integrate Agora’s Interactive Whiteboard with our newest Fastboard SDK that delivers all the same whiteboard features with a pre-built UI and the ability to include custom plug ins.
Try it Now
No items found.
Request more information
Connect with our experts to answer your questions, discuss requirements, and provide more detail on the ConvoAI Device Kit

Frequently asked questions

How does Agora improve the experience in comparison with other solutions for voice interaction with AI?

Agora enables more natural voice conversations with AI, thanks to low-latency responses and real-time interruption handling. Agora’s built-in background noise suppression, echo cancelation, and selective attention locking allow AI to hear the user clearly in any environment. Agora’s global real-time network ensures connectivity and performance in any location.

What LLMs can be connected to Agora’s Conversational AI Engine?

You can connect to any LLM that is OpenAI compatible, including OpenAI’s GPT models, Google Gemini, DeepSeek or any custom model that is OpenAI-compatible. Support for additional LLMs coming soon!

What additional technology is required to implement a voice AI agent?

To implement a voice AI agent, you need to connect an LLM and a text-to-speech service to Agora’s Conversational AI Engine. This enables full customization of the experience, with the LLM and voice of your choice.

What is a "chained model" in relation to conversational voice AI?

The chained model refers to the flow of the user’s voice being processed by speech-to-text technology, then that text being processed by the LLM, then the LLM’s response being processed by text-to-speech technology and ultimately outputting the AI agent’s voice response.

Does Agora’s Conversational AI Engine enable the creation of an AI model or LLM?

No, Agora’s Conversational AI Engine requires an existing AI model or LLM. The Engine enables customized voice interaction with the LLM but is not capable of creating or training an LLM.