Cloud-based transcription converts audio to text for active or selected hosts in real time. Text can be distributed as live captions to all participants in the channel.
LLM integration
Integrate speech to text with LLMs for further processing, without impacting RTC performance. Upload transcription text as .vtt files to LLMs like GPT to generate summaries, notes, and more.
Transcribing and labeling simultaneous speakers
Easily label who said what—even with up to 3 simultaneous speakers. Separate transcription for each host ensures accuracy and allows you to choose to transcribe for one specific host.
Captioning for cloud recordings
Transcribe audio to text on video or audio recordings to enable closed captions (CC) on playback or review important discussion items in the transcript.
Multi-language support
Real-time transcription supports all major languages and dialects, and each channel can support audio-to-text transcription for up to two languages simultaneously.
Enterprise-grade security and compliance
Agora is ISO and SOC 2 certified and meets compliance standards for regional privacy laws and industry regulations, including GDPR, CCPA, and HIPAA. Live captions and transcription can be encrypted in the same way as encrypted RTC audio or video.
One real-time view
for the metrics that
matter the most
Use a single dashboard to monitor every active session around the world. Track the metrics that are most
important to you, from concurrent users and channels to network latency and so much more.
With Interactive Whiteboard, you can build a collaborative app fast—with custom branding and full of features. Our platform makes it easy to create a customized and engaging learning environment.
Flexible APIs support custom branding and extensive digital whiteboard features.
Easily integrate real-time voice and video calling, interactive streaming and signaling.
Save users’ bandwidth by preloading, sharing, and annotating files, and retain all the dynamic content.
And have peace of mind with HIPAA, GDPR, and CCPA compliance.
See OpenAI's Realtime API in action
Instantly transcribe speech to text for live audio and video
Agora’s Real-Time Speech to Text provides accurate live transcription and subtitling services at a low cost.
Reduce cost and increase efficiency
More efficient and cost-effective than traditional client-side live transcription, Agora’s solution by uses advanced technology to remove silence, reduce Word Error Rate (WER), and distribute live captions to all participants in a channel.
Reduce cost and increase efficiency
Get the most accurate results at scale
Cutting-edge AI ensures the highest accuracy even with overlapping speech, regional accents, and poor network conditions. Scale from one-to-one meetings to up to millions of participants with the same accuracy.
Get the most accurate results at scale
Integrate with ease
Agora’s Real-Time Speech to Text is highly integrated with Agora’s network (SD-RTN™), providing global user transcription and real-time text distribution even in poor network environments.
Integrate with ease
Recording options for:
Cloud recording
Store, retrieve and share recordings in the cloud.
Directly push media streams into Agora voice and video channels using the RTMP/SRT protocol and enable advanced transcoding processing on media streams to facilitate distribution.
Build and integrate real-time visual collaboration features into your application with the most flexibility and full customization using Agora's Interactive Whiteboard SDK.
Agora is certified to the ISO/IEC 27001, 27017, 27018, 27701 and SOC 2 security standards and meets privacy regulations like GDPR, CCAP, COPPA, and HIPAA. Agora doesn’t collect or store any end-user data aside from Internet Protocol (IP) addresses and operational information necessary for providing our services.
ISO 27001:2022
ISO 27017:2015
ISO 27018:2019
ISO 27701:2019
HIPAA
GDPR
SOC2 Type1&2
CCPA
COPPA
Use cases
Transcribe speech to text for any real-time application
Securely transcribe and record real-time audio or video and organize recordings and transcripts to speed up workflows.
Give faculty and students real-time captions and analyze them with an LLM to provide lesson summaries and suggestions for further learning.
Telehealth
Keep secure records of virtual appointments for Minimum Effective Response (MER) and cross-reference telehealth knowledge bases.
Events
Empower your event with real-time, accurate notes, ensuring a more accessible, searchable, and engaging event experience.
Live shopping
Use virtual assistants to improve accessibility and reach a wider audience by offering detailed product information, personalized recommendations, and guiding customers through the purchasing process.
Virtual meetings
Provide real-time automated notes in meetings and document outstanding questions and action items via an LLM.
Social & metaverse
Eliminate communication barriers for people with different languages or disabilities. Extract conversation for business optimization, advertising, and moderation.
Fastboard
Easily build and integrate Agora’s Interactive Whiteboard with our newest Fastboard SDK that delivers all the same whiteboard features with a pre-built UI and the ability to include custom plug ins.
“Agora’s Real-Time Speech to Text enabled us to integrate with AI to automate translation and feedback, providing substantial improvements in the overall language learning experience.”