Blog

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Speech-To-Text

Best speech-to-text APIs in 2023

Speech-to-text (STT), also known as automatic speech or voice recognition, is a type of AI technology that recognizes human speech in audio or video and transcribes it into written output. In the form of an API, it can power a variety of applications, ranging from call bots to voice assistants to AI-powered virtual meeting platforms.

Tutorials

How to build a voice-to-text Discord bot with Gladia real-time transcription API

Discord, the leading communication platform for gamers and communities, is designed for seamless communication with other users, be it through text channels, DMs, 1-1 calls or even collective voice channels.

Product News

Here’s how we optimized Whisper ASR for enterprise scale

In this article, we give you a breakdown of features and parameters that distinguish Gladia API from both open-source and API versions of OpenAI’s Whisper ASR model. 

Speech-To-Text

Introduction to speech-to-text AI

Speech-to-text (STT), also known as Automatic Speech Recognition (ASR), is an AI technology that transcribes spoken language into written text. Previously reserved for the privileged few, STT is becoming increasingly leveraged by companies worldwide to embed new audio features in existing apps and create smart assistants for a range of use cases.

Speech-To-Text

How to build a Google Meet transcription bot with Python, React and Gladia API

In today's fast-paced world, effective communication and collaboration are essential. Tools like Google Meet have revolutionized how we connect and conduct meetings remotely. However, it can be very challenging to keep track of all action items and key insights shared during long meetings.

Speech-To-Text

Thinking of using open-source Whisper ASR? Here are the main factors to consider

Perhaps you’re a developer looking for an Automatic Speech Recognition (ASR) solution for the first time. Or an executive looking for more affordable, faster, more accurate alternatives to the mainstream speech-to-text solutions for your business. Where do you turn to?