Blog

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Speech-To-Text

An introduction to ASR speaker recognition: identification, verification and diarization

Due to individual differences in physical attributes like vocal tract shapes, every person possesses a distinct voice pattern. In automatic speech recognition (ASR), this uniqueness is harnessed to identify and analyze speakers by extracting and analyzing voice features such as pitch and frequencies.

Tutorials

Building a Whisper YouTube transcription generator for automated captioning

With over 500 hours of video uploaded to YouTube every minute, providing accurate captions and transcripts is essential for creators to make their content engaging and accessible. However, manually transcribing long videos is tedious and time-consuming.

Tutorials

How to summarize audio using Whisper ASR and GPT 3.5

From online meetings to voice memos and media content, the amount of audio data generated by companies daily is as vast as it is valuable.

Speech-To-Text

Best network architecture for speech recognition software

Building high-quality speech recognition software for your businesses has never been easier. But one needs the right infrastructure to make the most out of AI transcription at an enterprise scale.

Speech-To-Text

Best prompts for summarizing online meetings with large language models

Online meetings quickly generate copious amounts of information, often including quite a bit of “noise” and with important takeaways interspersed with a multitude of less relevant details and dead-end discussions. The sheer abundance of information deliberated during online meetings can be overwhelming, and your users may want a solution to that.

Product News

Recall and Gladia join forces to power online meetings transcription

Today, we are thrilled to announce a partnership aimed at empowering businesses and developers worldwide to fully leverage data from online meetings.