Designing Escalation Paths Between AI and Human Agents
The handoff between AI and human is where most "AI customer support" projects succeed or fail. A clean handoff makes the AI feel like a productive teammate. A bad handoff makes the customer repeat themselves to a human who has no context, which is worse than no AI at all.
The handoff between AI and human is where most "AI customer support" projects succeed or fail. A clean handoff makes the AI feel like a productive teammate. A bad handoff makes the customer repeat themselves to a human who has no context, which is worse than no AI at all. The discipline of escalation design matters more than the AI itself.
TL;DR
- The escalation path is its own product. Design it deliberately.
- Hand off with context: what the agent tried, what the customer wants, what's at stake.
- Make escalation fast and obvious β don't make customers fight for it.
- Track escalation appropriateness, not just escalation rate.
When to escalate
Six common triggers:
- Caller explicitly asks. "Can I talk to a human?" β escalate immediately, no second-guessing.
- Caller is upset. Sentiment cues: raised voice, profanity, repeat frustration.
- Multiple failed attempts. Agent has tried 2β3 times to clarify or resolve and is stuck.
- Out of scope. The request falls outside the agent's defined intents.
- High stakes. Refund above policy cap, complaint about a serious issue, sensitive medical/legal context.
- Compliance. Some interactions legally require a human (collections in some jurisdictions, certain healthcare contexts).
What makes a good handoff
Three things every escalation should include:
Context. A 2β3 sentence summary of why the call is escalating and what was tried.
Intent. What the customer actually wants.
State. Any relevant facts captured (account ID, order number, dates discussed).
This goes to the receiving human via:
- Spoken summary if the call transfers immediately.
- Written summary in the ticketing system.
- Both, ideally.
What the customer should hear
The transition language matters:
Good:
"Got it β I'm going to get you to one of our team members who can help with that. One moment."
Bad:
"Transferring..."
(Silence)
Or worse:
"I'm sorry, I can't help with that. click"
The transition should feel intentional and respectful. The customer shouldn't feel like they got hung up on or like the agent gave up.
What the receiving human should hear
If the call transfers warm (immediate to a live human), the human should get a brief audio handoff:
(In the human's headset, before the customer is connected:) "This is Sarah Chen, calling about order 4521 β package delivered but missing. Tried tracking lookup. She'd like a refund or replacement. Refund would be over agent authority."
This takes 3β5 seconds. Worth every one.
If the handoff is asynchronous (customer is told a human will call back), the same context goes into the ticket:
Customer: Sarah Chen, +14155550199
Account: #1976432
Issue: Missing delivery for order #4521 (marked delivered 2 days ago)
Caller wants: Refund of $145 or replacement
What I tried: Looked up tracking, confirmed delivery scan
Why escalating: Refund amount above agent authority
Suggested action: Approve refund or replacement per policy
The "warm vs cold" decision
Warm transfer (live to live) is best when:
- The customer is already engaged.
- The wait for a human is short (under 2 minutes).
- The issue is complex enough that re-explaining would frustrate.
Cold transfer (callback or queued) is acceptable when:
- Wait times are long.
- The customer prefers a callback.
- The issue can wait without urgency.
Bad: dropping the call entirely. Don't do this.
Agent-side: how to invoke escalation
The function:
function transfer_to_human({ reason, summary, urgency }) {
// ...
}
The prompt:
When you need to escalate, call transfer_to_human with:
- reason: a 1-line description of why
- summary: 2-3 sentence summary of the call so far
- urgency: 'immediate' for warm transfer, 'callback' for async
Before calling transfer_to_human, briefly tell the caller:
"I'm getting you to someone who can help with that β one moment."
The "let me get you to someone" choreography
Before calling the function:
- Brief acknowledgment to the caller.
- Function call:
transfer_to_human(...). - Wait for the function to return (it sets up the transfer).
- Confirm to the caller: "OK β connecting you now."
The whole thing takes 3β4 seconds. Done well, it feels intentional.
Tracking escalation quality
Beyond raw rate, track:
Appropriateness. Of escalations, what percentage were correct? (Catch with eval grading.)
Late escalations. Calls where the agent should have escalated earlier but kept trying.
Premature escalations. Calls where the agent escalated easy stuff that should have been resolved.
Customer satisfaction post-escalation. Did the escalated calls end well?
If your escalation rate is "right" (say, 25%) but appropriateness is 60%, you have a problem. The right rate with good appropriateness is the metric.
Common escalation bugs
The "I'll have someone call you" lie. The agent promises a callback but doesn't actually queue one. Always verify the function executed successfully before promising the action.
The summary the human can't use. "Customer needs help." Useless. The summary needs the specifics.
The transfer to a queue with no SLA. Customer waits 30 minutes; gives up. Either escalate to a real human in seconds or don't promise it.
The escalation that escalates too fast. Agent transfers because the caller said "hmm" once. Define the criteria more precisely.
Compliance considerations
Some regulated industries require:
- Documented reason for AI-to-human handoff.
- Same-channel escalation (can't transfer voice to chat).
- Specific timing (must reach a human within N seconds).
Know your regulatory context.
For more on the broader operational pattern, see the definitive guide to AI customer support in 2026.
Related reading
- Building a Tier-1 AI Support Agent Step by Step
- Why "Human-in-the-Loop" Beats "Fully Autonomous" for Most Teams
- How to Calculate ROI for AI Customer Support
- How AI Support Agents Should Handle Account Verification
- Designing AI Agents That Cancel Subscriptions Honestly
FAQ
What's a typical escalation rate? 20β40% for tier-1 support. Higher for complex use cases.
Should the agent always say "transferring to a human"? Not always β some teams use "specialist" or "advisor" as gentler language. Avoid "manager" unless that's literally true.
Can escalation reverse? Rarely. If the human agent decides AI should re-handle, they manually queue. Mostly one-direction.
What about chat-to-voice escalation? Yes, and increasingly common. "Want me to call you?" is a powerful escalation when chat hits its limit.
How do I train the human team to handle escalated calls? Include the AI summary in their workflow. Teach them to verify key facts before continuing.

Rohan Pavuluri builds SIMBA Voice Agents at Speechify. Previously, he founded and led Upsolve, the largest nonprofit in the United States serving low-income Americans through technology. He writes about real-world voice-agent deployments β customer support, outbound sales, AI receptionists β and the practical product, design, and operational lessons that actually move the needle.
More from Rohan Pavuluri
View all βSIMBA vs Avoca: Which AI Voice Agent Platform Is Right for Your Service Business?
Avoca raised $125M at a $1B valuation for home services voice AI. SIMBA takes a different approach β horizontal platform, published pricing, IVR navigation, and a dedicated engineer for every customer.
Voice AI for Commercial Real Estate: Leasing, Tenant Services, and Property Operations
Commercial real estate has distinct communication patterns from residential. Voice AI handles leasing inquiries, building ops, CAM questions, and broker qualification across office, retail, and industrial.
Voice Agents for Tenant Communication: Maintenance, Rent, and Lease Management at Scale
Managing tenant communication at scale breaks at about 200 units per property manager. Voice agents handle the entire lifecycle β inquiries, applications, maintenance, rent, renewals, and move-outs.
Related reading
Why "Human-in-the-Loop" Beats "Fully Autonomous" for Most Teams
The fully autonomous AI customer service agent is the AI industry's preferred fantasy. The reality in 2026 is that the best-performing deployments are hybrid: AI handles most volume, humans handle the edge cases and provide supervision, and the line between them is carefullyβ¦
How to Calculate ROI for AI Customer Support
ROI calculations for AI customer support often use the wrong baselines and the wrong metrics. The result: numbers that look great in a deck but don't match reality once deployed. The right model captures the full cost and benefit stack, including second-order effects.
Designing AI Agents That Cancel Subscriptions Honestly
Subscription cancellation is a legally loaded support interaction. Several jurisdictions now require cancellation to be as easy as signup ("click-to-cancel" laws).
Voice AI, twice a month.
Get the best of the SIMBA resources hub β new articles, trend notes, and operator guides. No spam.
