
Tyler Weitzman is co-founder and Head of AI at Speechify. He has spent the past decade building the speech-synthesis stack that powers millions of users. Tyler writes about the engineering of real-time conversational systems — text-to-speech, speech recognition, latency budgets, model serving, and the architectural choices that separate prototypes from production-grade voice agents.
Articles by Tyler Weitzman (73)
Open-Source vs Proprietary Voice Agent Stacks
Open-Source vs Proprietary Voice Agent Stacks. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Build vs Buy: When to Build Your Own Voice Agent
Build vs Buy: When to Build Your Own Voice Agent. A practical, vendor-neutral guide for teams building or buying voice AI agents.
ElevenLabs vs Vapi vs Retell: A Technical Comparison
ElevenLabs vs Vapi vs Retell: A Technical Comparison. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How to Integrate Voice Agents with a Custom REST API
How to Integrate Voice Agents with a Custom REST API. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Connecting Voice Agents to Snowflake or BigQuery
Connecting Voice Agents to Snowflake or BigQuery. A practical, vendor-neutral guide for teams building or buying voice AI agents.
SIP vs WebRTC for Voice Agents
SIP vs WebRTC for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Bring Your Own Twilio: Pros, Cons, and Setup
Bring Your Own Twilio: Pros, Cons, and Setup. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How to Use Twilio Studio with AI Voice Agents
How to Use Twilio Studio with AI Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Connecting Voice Agents to Stripe for Payments
Connecting Voice Agents to Stripe for Payments. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Webhooks 101 for Voice Agents
Webhooks 101 for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
SIP Trunking 101 for Voice Agent Builders
SIP Trunking 101 for Voice Agent Builders. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Twilio + Voice Agents: A Complete Guide
Twilio + Voice Agents: A Complete Guide. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How to Benchmark a Voice Agent's End-to-End Latency
How to Benchmark a Voice Agent's End-to-End Latency. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Whisper vs Deepgram vs ElevenLabs STT
Whisper vs Deepgram vs ElevenLabs STT. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Streaming Audio Over WebRTC for Voice Agents
Streaming Audio Over WebRTC for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Comparing Neural TTS Architectures
Comparing Neural TTS Architectures. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Phoneme-Level Tuning for Voice Agents
Phoneme-Level Tuning for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Why Some Voices Sound Robotic Even in 2026
Why Some Voices Sound Robotic Even in 2026. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Echo Cancellation in Real-Time Voice AI
Echo Cancellation in Real-Time Voice AI. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How Sample Rate Affects Voice Agent Quality
How Sample Rate Affects Voice Agent Quality. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How Background Noise Affects Voice Agent Accuracy
How Background Noise Affects Voice Agent Accuracy. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Audio Codecs for Voice Agents: Opus, PCMU, and More
Audio Codecs for Voice Agents: Opus, PCMU, and More. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Diarization: Knowing Who's Speaking in a Voice Conversation
Diarization: Knowing Who's Speaking in a Voice Conversation. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Voice Activity Detection in Production Voice Agents
Voice Activity Detection in Production Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
The Engineering Behind Sub-Second Voice Agents
The Engineering Behind Sub-Second Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How STT Handles Disfluencies and Filler Words
How STT Handles Disfluencies and Filler Words. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Multilingual TTS: Choosing a Voice Model
Multilingual TTS: Choosing a Voice Model. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Why TTS Quality Plateaus and How to Push Past It
Why TTS Quality Plateaus and How to Push Past It. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How TTS Models Handle Numbers, Dates, and Acronyms
How TTS Models Handle Numbers, Dates, and Acronyms. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Streaming STT: How to Cut Recognition Latency
Streaming STT: How to Cut Recognition Latency. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Streaming TTS: How to Cut First-Audio Latency
Streaming TTS: How to Cut First-Audio Latency. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Latency Engineering for Real-Time Voice Agents
Latency Engineering for Real-Time Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Voice Cloning: How It Works and Why It Matters
Voice Cloning: How It Works and Why It Matters. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Speech-to-Text Word Error Rate Explained
Speech-to-Text Word Error Rate Explained. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Text-to-Speech in 2026: The State of the Art
Text-to-Speech in 2026: The State of the Art. A practical, vendor-neutral guide for teams building or buying voice AI agents.
DTMF and IVR Navigation for Outbound Voice Agents
DTMF and IVR Navigation for Outbound Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How to Handle Personally Identifiable Information in Voice Agents
How to Handle Personally Identifiable Information in Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Open-Source vs Closed-Source LLMs for Voice Agents
Open-Source vs Closed-Source LLMs for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How LLMs Decide What to Say Next in a Voice Conversation
How LLMs Decide What to Say Next in a Voice Conversation. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Red-Teaming Your Voice Agent
Red-Teaming Your Voice Agent. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Building a Conversation Memory Layer for Voice Agents
Building a Conversation Memory Layer for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Why Context Windows Matter Less Than You Think for Voice
Why Context Windows Matter Less Than You Think for Voice. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How to A/B Test Voice Agent Prompts
How to A/B Test Voice Agent Prompts. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Streaming LLM Outputs to Voice: The Engineering
Streaming LLM Outputs to Voice: The Engineering. A practical, vendor-neutral guide for teams building or buying voice AI agents.
The Role of Embeddings in Voice Agent Knowledge
The Role of Embeddings in Voice Agent Knowledge. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How to Stop a Voice Agent from Hallucinating
How to Stop a Voice Agent from Hallucinating. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Multi-Agent Architectures for Customer Service
Multi-Agent Architectures for Customer Service. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Designing System Prompts for Multi-Turn Voice Conversations
Designing System Prompts for Multi-Turn Voice Conversations. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Tool Use vs Function Calling: What's the Difference?
Tool Use vs Function Calling: What's the Difference?. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Why Smaller LLMs Often Win for Voice Agents
Why Smaller LLMs Often Win for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Guardrails for Voice Agents: A Pragmatic Take
Guardrails for Voice Agents: A Pragmatic Take. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Retrieval-Augmented Generation for Voice Agents
Retrieval-Augmented Generation for Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
LLM Evaluation for Conversational Agents
LLM Evaluation for Conversational Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How to Give a Voice Agent Long-Term Memory
How to Give a Voice Agent Long-Term Memory. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Prompt Engineering for Voice (vs Text) Agents
Prompt Engineering for Voice (vs Text) Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Function Calling for Voice Agents: A Practical Guide
Function Calling for Voice Agents: A Practical Guide. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How Large Language Models Power Voice Agents
How Large Language Models Power Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
The Hidden Complexity of Numbers in Voice Agents
The Hidden Complexity of Numbers in Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How Voice Agents Handle Accents and Dialects
How Voice Agents Handle Accents and Dialects. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How to Measure Voice Agent Quality
How to Measure Voice Agent Quality. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How Voice Agents Recover from Misunderstandings
How Voice Agents Recover from Misunderstandings. A practical, vendor-neutral guide for teams building or buying voice AI agents.
The Difference Between Streaming and Non-Streaming Voice Agents
The Difference Between Streaming and Non-Streaming Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How Voice Agents Decide When to Stop Talking
How Voice Agents Decide When to Stop Talking. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Synchronous vs Asynchronous Voice Agents
Synchronous vs Asynchronous Voice Agents. A practical, vendor-neutral guide for teams building or buying voice AI agents.
What Makes a Voice Agent "Production Ready"
What Makes a Voice Agent "Production Ready". A practical, vendor-neutral guide for teams building or buying voice AI agents.
Why Voice Agents Sound More Human Every Year
Why Voice Agents Sound More Human Every Year. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How Voice Agents Differ from Voice Assistants
How Voice Agents Differ from Voice Assistants. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How Voice Agents Handle Interruptions Gracefully
How Voice Agents Handle Interruptions Gracefully. A practical, vendor-neutral guide for teams building or buying voice AI agents.
The Anatomy of a Voice Agent Pipeline
The Anatomy of a Voice Agent Pipeline. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Turn-Taking and Barge-In: The Mechanics of Natural Conversation
Turn-Taking and Barge-In: The Mechanics of Natural Conversation. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Latency in Voice AI: Why Sub-500ms Matters
Latency in Voice AI: Why Sub-500ms Matters. A practical, vendor-neutral guide for teams building or buying voice AI agents.
Voice Agents vs Chatbots: When to Use Which
Voice Agents vs Chatbots: When to Use Which. A practical, vendor-neutral guide for teams building or buying voice AI agents.
How a Conversational Voice Agent Actually Works (Under the Hood)
How a Conversational Voice Agent Actually Works (Under the Hood). A practical, vendor-neutral guide for teams building or buying voice AI agents.