Gladia

Product News

What is speaker diarization?

One of the major obstacles for speech-to-text AI has been identifying individual speakers in a multi-speaker audio stream before transcribing the speech. This is where speaker separation, also known as diarization, comes into play.

Product News

March 2023 Roadmap its Speech-to-Text API: Speaker Diarization, Word-Level Timestamps and more

A glimpse into Gladia's roadmap for its Speech-to-Text API, starting with speaker diarization. We’re incredibly excited to be building our Audio Intelligence product in a community-led way, delivering a holistic final product adapted to the many needs and use cases brought to our attention.

Speech-To-Text

Here’s how speech-to-text AI can benefit your business today

Speech-to-text AI is entering an exciting phase and becoming a commodity. By powering Audio intelligence, products like Gladia's Audio Transcription API create value for all businesses, from collaboration platforms to content studios to media companies to call centers.

Speech-To-Text

Prompt injection in speech recognition explained

Following the release of ChatGPT, prompt engineering for LLMs became one of the most widely-discussed fields in AI. Prompt injection in Speech Recognition in particular, used to guide the underlying model to produce more accurate results, is a fascinating NLP technique worth exploring in more detail.

Product News

Redefining what’s possible with speech-to-text AI

Note: This article was originally published on Medium in February 2023.

Blog

What is speaker diarization?

March 2023 Roadmap its Speech-to-Text API: Speaker Diarization, Word-Level Timestamps and more

Here’s how speech-to-text AI can benefit your business today

Prompt injection in speech recognition explained

Redefining what’s possible with speech-to-text AI

From audio to knowledge

Subscribe to receive latest news, product updates and curated AI content.

Blog

Newsletter