🔊

Speech Technology

TTS, STT, voice cloning, latency engineering, and the hard parts of making AI sound human.

25 articles

🔊 Speech Tech

How to Benchmark a Voice Agent's End-to-End Latency

How to Benchmark a Voice Agent's End-to-End Latency. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 21, 2026 · 4 min
🔊 Speech Tech

Whisper vs Deepgram vs ElevenLabs STT

Whisper vs Deepgram vs ElevenLabs STT. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 20, 2026 · 4 min
🔊 Speech Tech

Streaming Audio Over WebRTC for Voice Agents

Streaming Audio Over WebRTC for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 20, 2026 · 4 min
🔊 Speech Tech

Comparing Neural TTS Architectures

Comparing Neural TTS Architectures. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 19, 2026 · 4 min
🔊 Speech Tech

Phoneme-Level Tuning for Voice Agents

Phoneme-Level Tuning for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 19, 2026 · 4 min
🔊 Speech Tech

Why Some Voices Sound Robotic Even in 2026

Why Some Voices Sound Robotic Even in 2026. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 18, 2026 · 4 min
🔊 Speech Tech

Voice Cloning for Customer Brands: A Buyer's Guide

Voice Cloning for Customer Brands: A Buyer's Guide. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Cliff Weitzman · Mar 18, 2026 · 4 min
🔊 Speech Tech

Echo Cancellation in Real-Time Voice AI

Echo Cancellation in Real-Time Voice AI. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 17, 2026 · 4 min
🔊 Speech Tech

How Sample Rate Affects Voice Agent Quality

How Sample Rate Affects Voice Agent Quality. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 17, 2026 · 4 min
🔊 Speech Tech

How Background Noise Affects Voice Agent Accuracy

How Background Noise Affects Voice Agent Accuracy. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 16, 2026 · 4 min
🔊 Speech Tech

Audio Codecs for Voice Agents: Opus, PCMU, and More

Audio Codecs for Voice Agents: Opus, PCMU, and More. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 16, 2026 · 4 min
🔊 Speech Tech

Diarization: Knowing Who's Speaking in a Voice Conversation

Diarization: Knowing Who's Speaking in a Voice Conversation. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 15, 2026 · 4 min
🔊 Speech Tech

Voice Activity Detection in Production Voice Agents

Voice Activity Detection in Production Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 14, 2026 · 4 min
🔊 Speech Tech

The Engineering Behind Sub-Second Voice Agents

The Engineering Behind Sub-Second Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 14, 2026 · 4 min
🔊 Speech Tech

How STT Handles Disfluencies and Filler Words

How STT Handles Disfluencies and Filler Words. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 13, 2026 · 4 min
🔊 Speech Tech

Multilingual TTS: Choosing a Voice Model

Multilingual TTS: Choosing a Voice Model. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 13, 2026 · 4 min
🔊 Speech Tech

Why TTS Quality Plateaus and How to Push Past It

Why TTS Quality Plateaus and How to Push Past It. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 12, 2026 · 4 min
🔊 Speech Tech

How TTS Models Handle Numbers, Dates, and Acronyms

How TTS Models Handle Numbers, Dates, and Acronyms. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 12, 2026 · 4 min
🔊 Speech Tech

Streaming STT: How to Cut Recognition Latency

Streaming STT: How to Cut Recognition Latency. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 11, 2026 · 4 min
🔊 Speech Tech

Streaming TTS: How to Cut First-Audio Latency

Streaming TTS: How to Cut First-Audio Latency. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 11, 2026 · 4 min
🔊 Speech Tech

Latency Engineering for Real-Time Voice Agents

Latency Engineering for Real-Time Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 10, 2026 · 4 min
🔊 Speech Tech

Voice Cloning Ethics: A Practical Framework

Voice Cloning Ethics: A Practical Framework. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Cliff Weitzman · Mar 10, 2026 · 4 min
🔊 Speech Tech

Voice Cloning: How It Works and Why It Matters

Voice Cloning: How It Works and Why It Matters. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 9, 2026 · 4 min
🔊 Speech Tech

Speech-to-Text Word Error Rate Explained

Speech-to-Text Word Error Rate Explained. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 9, 2026 · 4 min
🔊 Speech Tech

Text-to-Speech in 2026: The State of the Art

Text-to-Speech in 2026: The State of the Art. A practical, vendor-neutral guide for teams building or buying voice AI agents.

Tyler Weitzman · Mar 8, 2026 · 4 min