Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Azure Speech Services vs Gladia: Enterprise SLA, data residency & compliance comparison
Azure Speech Services vs Gladia: Compare enterprise SLA, compliance, pricing, and data residency for speech to text infrastructure. Both platforms meet SOC 2 Type 2 and GDPR requirements, but differ on cost structure and integration speed for product teams building at scale.
Best real-time STT models for meeting assistants 2026
Best real-time STT models for meeting assistants in 2026 compared on latency, diarization, and multilingual accuracy for live calls. Gladia Solaria-1 delivers 103ms partial latency with bundled diarization and native code-switching across 100+ languages at $0.55 per hour, all features included.
How to transcribe Google Meet calls: Complete implementation guide for async meeting transcription
How to transcribe Google Meet calls using bots, browser extensions, or the Meet Media API with production grade STT backends. Choose the right audio capture architecture and STT provider to ship accurate, multilingual transcription with speaker diarization in under 24 hours.
How to build a voice-to-text Discord bot with Gladia real-time transcription API
Published on Sep 21, 2023
Discord, the leading communication platform for gamers and communities, is designed for seamless communication with other users, be it through text channels, DMs, 1-1 calls or even collective voice channels.
Based on multiple request from our Discord members, we’ve built a custom JavaScript bot that makes use of Gladia’s live transcription API to transcribe speech in real time directly on the Discord server.
What can you do with Discord bot?
First, you can transcribe voice in real time directly on Discord’s voice channels. Ex. you’re streaming a game on Discord and want to access some learnings and tips received during the sessions. Or, you’re having your group gathers on the platform and want to be able to review the talking points after – just like with any other virtual meeting platform.
Beyond that, a bot like this could be used for real-time moderation to flag hate speech and ban users. With additional tools like ChatGPT, you could also create command-based notes to provide meeting summaries and helps you catch up with meetings you may have missed.
How to implement the Discord.js v14 bot + Gladia real-time transcription
Step 1: Register your bot
Create a Discord bot that you'd like to use for transcription. If you’ve never built one before, here’s a useful resource to help.
First, install all the required package by running:
npm install
Then, you will to setup the index.js script with your Discord keys, guild ID (Server ID), and the Voice Channel ID.
Step 2: Retrieve API key
Sign up for our speech-to-text API at app.gladia.io and obtain your API key. Documentation for Gladia live transcription can be found here.
Step 3: Code integration
Once everything is set up properly, simply run:
npm run start YOUR_GLADIA_TOKEN
Your bot should then join the channel corresponding to the channel ID you configured in the index.js file.
Step 4: Configure Discord permissions
Make sure your bot is invited on the server;
Give the bot the required voice permissions.
Bear in mind that the current v1 implementation of the bot is not fully optimized, so you might experiences inaccuracy regarding language changes & words.
We hope you enjoyed this short tutorial. Given how much audio data still goes to wasted, we’re always curious to explore the many ways in which transcription tech can be used to remedy that. Let us know if you went on to build a bot or used our API for others apps on Discord or beyond, we’d love to hear from you.
About Gladia
At Gladia, we built an optimized version of Whisper in the form of an API, adapted to real-life use cases and distinguished by exceptional accuracy, speed, extended multilingual capabilities and state-of-the-art features, including speaker diarization and word-level timestamps.
Contact us
Your request has been registered
A problem occurred while submitting the form.
Read more
Speech-To-Text
Azure Speech Services vs Gladia: SLA & Pricing Review
Speech-To-Text
Best STT API for Meeting Assistants 2026 Comparison
Speech-To-Text
How to Transcribe Google Meet Calls: Complete Implementation Guide for Real-Time & Post-Meeting Transcription
From audio to knowledge
Subscribe to receive latest news, product updates and curated AI content.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.