How STT Handles Disfluencies and Filler Words
How STT Handles Disfluencies and Filler Words. A practical, vendor-neutral guide for teams building or buying voice AI agents.
This article is being written.
We're publishing every couple of days through 2026. In the meantime, browse other articles in Speech Technology.

Tyler Weitzman is co-founder and Head of AI at Speechify. He has spent the past decade building the speech-synthesis stack that powers millions of users. Tyler writes about the engineering of real-time conversational systems โ text-to-speech, speech recognition, latency budgets, model serving, and the architectural choices that separate prototypes from production-grade voice agents.
Related reading
Whisper vs Deepgram vs ElevenLabs STT
Whisper vs Deepgram vs ElevenLabs STT. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How Background Noise Affects Voice Agent Accuracy
How Background Noise Affects Voice Agent Accuracy. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Speech-to-Text Word Error Rate Explained
Speech-to-Text Word Error Rate Explained. A practical, vendor-neutral guide for teams building or buying voice AI agents.