Blog

Technical guides, customer stories, and product updates
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Product News

What is speaker diarization?

One of the major obstacles for speech-to-text AI has been identifying individual speakers in a multi-speaker audio stream before transcribing the speech. This is where speaker separation, also known as diarization, comes into play.

Product News

March 2023 Roadmap its Speech-to-Text API: Speaker Diarization, Word-Level Timestamps and more

A glimpse into Gladia's roadmap for its Speech-to-Text API, starting with speaker diarization. We’re incredibly excited to be building our Audio Intelligence product in a community-led way, delivering a holistic final product adapted to the many needs and use cases brought to our attention.

Speech-To-Text

Here’s how speech-to-text AI can benefit your business today

Speech-to-text AI is entering an exciting phase and becoming a commodity. By powering Audio intelligence, products like Gladia's Audio Transcription API create value for all businesses, from collaboration platforms to content studios to media companies to call centers.

Speech-To-Text

Prompt injection in speech recognition explained

Following the release of ChatGPT, prompt engineering for LLMs became one of the most widely-discussed fields in AI. Prompt injection in Speech Recognition in particular, used to guide the underlying model to produce more accurate results, is a fascinating NLP technique worth exploring in more detail.

Product News

Redefining what’s possible with speech-to-text AI

Note: This article was originally published on Medium in February 2023.