Blog

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Speech-To-Text

Introduction to speech-to-text AI

Speech-to-text (STT), also known as Automatic Speech Recognition (ASR), is an AI technology that transcribes spoken language into written text. Previously reserved for the privileged few, STT is becoming increasingly leveraged by companies worldwide to embed new audio features in existing apps and create smart assistants for a range of use cases.

Speech-To-Text

How much does it really cost to host Whisper AI transcription?

Open-source ASR models are often presented as the most cost-effective solution to embedding Language AI into your applications. But is that always the case? Here's our take.

Speech-To-Text

Thinking of using open-source Whisper ASR? Here are the main factors to consider

Perhaps you’re a developer looking for an Automatic Speech Recognition (ASR) solution for the first time. Or an executive looking for more affordable, faster, more accurate alternatives to the mainstream speech-to-text solutions for your business. Where do you turn to?

Speech-To-Text

Here’s how to pick the right speech-to-text provider for your Speech AI journey

Until recently, AI speech-to-text has been reserved for the happy few. But commodification is on its way. As prices dropped while the accuracy and speed of transcription increased, there has been an explosion of speech-to-text providers catering to a broader range of companies and use cases. In this article, we give you a bird's-eye view of the market and introduce you to the speed-accuracy-cost tradeoff in audio transcription to help you pick the best Automatic Speech Recognition (ASR) provider for your use case and budget. 

Case Studies

Powering virtual meetings with Speech to Text AI: Claap's success story with Gladia

A case study showcasing the benefits of Gladia's AI API for Claap, an all-in-one video workspace that implemented our solution to provide its international users with advanced video transcription capabilities.

Product News

From Speech to Knowledge: Gladia’s Audio Intelligence API

Gladia is proud to announce the general availability of its groundbreaking Speech-to-Text API, previously in alpha. The revamped enterprise-grade API supports transcription, speaker diarization, word-level timestamp, code-switching, and beta translation in 99 languages.