If you can only track one metric for AI customer support, it should be First-Contact Resolution (FCR). Not deflection. Not handle time. Not even CSAT. FCR is the metric that captures what customers actually want — their issue, fixed, in one go — and it's the metric most resistant to gaming. This is why it matters and how to measure it honestly.

TL;DR

FCR is the percentage of contacts where the customer's issue is fully resolved on the first interaction.
It's the most customer-centric support metric — what they care about most.
AI can move FCR up or down. Most teams see modest gains; a few see big gains.
Don't tune AI for any metric in isolation. FCR plus CSAT plus cost is the right triangle.

What FCR actually measures

A contact is "first-contact resolved" if:

The customer reaches out about an issue.
The interaction concludes with the issue resolved.
The customer doesn't return for the same issue within a defined window (typically 7-14 days).

If the customer has to call back, chat, email, or escalate within the window, it's not FCR.

Why FCR is the right north star

Three reasons:

1. It captures customer experience. Customers don't care about your handle time. They care about whether their problem got solved.

2. It's hard to game. Most metrics can be gamed (deflection by hanging up on people, AHT by rushing). FCR is harder — you can't fake a customer not calling back.

3. It correlates with everything else. High FCR usually means low repeat-contact rate, high CSAT, lower cost per resolved issue. Low FCR usually drags everything down.

How AI affects FCR

Two directions:

AI can lift FCR by:

Resolving more bounded issues without escalation.
Faster lookups (data the human would have to search for).
Fewer transfers (single AI agent vs human-to-human handoff).
Better post-call follow-through (automated SMS confirmations).

AI can hurt FCR by:

Resolving "wrong" — customer thinks issue is fixed but it isn't.
Premature escalation that loses context.
Half-resolutions ("we'll get back to you").

The net effect depends on implementation quality. Good AI deployments lift FCR by 5-15 percentage points. Bad ones drag it down.

How to measure FCR for AI

The basic computation:

FCR = (Contacts that were not followed by a same-issue
       contact from the same customer within N days)
    / Total contacts

For AI specifically, segment:

AI-only resolved (no human involved)
AI + human (escalated to human, then resolved)
Human-only (skipped AI for some reason)

Compare AI-only FCR to human-only FCR. The gap tells you whether AI is actually helping or just shifting work.

Same-issue matching

The hard part: identifying which follow-up contacts are about the "same issue."

Approaches:

Topic clustering. Use intent tags or LLM-based classification to cluster.

Exact ticket matching. If the AI's first contact created a ticket, look for callbacks referencing the same ticket.

Customer-driven. Ask the customer at the start: "Are you calling about the same issue as last time?"

In practice, most teams use a combination of intent tags + a 14-day window.

What pushes FCR up

Specific tactics that move the needle:

Resolution checklists. The agent ensures all common follow-up needs are addressed before ending the call. ("I see you also have an upcoming appointment — want me to send you the calendar invite while we're talking?")

Explicit confirmation. "Just to confirm, we've resolved both items — the refund is processing and your address is updated. Anything else?" Catches half-resolutions.

Post-call follow-up. Automated SMS or email confirming what was done. Reduces "did this actually happen?" callbacks.

Knowledge base completeness. Agent has the answers to questions that previously required transfer.

What pushes FCR down

Common AI failures that hurt FCR:

Premature resolution. Agent thinks it solved the issue; it didn't.

Lost context on escalation. Customer has to re-explain to the human; problem-solving restarts; sometimes incomplete.

Over-deflection. Agent handles the call but customer leaves dissatisfied; calls back via different channel.

Action without follow-through. Agent says "we'll process this within 24 hours" — doesn't.

Setting FCR targets

Reasonable benchmarks:

Cohort	Typical FCR
Best-in-class human contact center	75-85%
Average human contact center	65-75%
Mature AI deployment	70-85%
New AI deployment	50-65%
Broken AI deployment	under 50%

Target: AI FCR within 5 points of human FCR (and ideally above).

FCR as a leading indicator

Watch FCR weekly. It moves before CSAT.

If FCR drops:

Customers are experiencing un-resolved interactions.
They'll express it in CSAT 2-4 weeks later.
And in churn 1-3 months later.

Catching FCR drops early lets you fix the root cause before downstream metrics decline.

What FCR doesn't capture

Three blind spots:

Long-tail high-value issues. A 30-minute call that resolves a complex issue is technically lower FCR by call count but high value.

Channel-shifting. Customer calls AI, gets unsatisfied, goes to chat with a human. Counts as not-FCR but the journey eventually resolved.

Complex new-customer onboarding. Multiple interactions are expected. Don't penalize the AI for this.

For these cases, segment FCR by intent type.

The FCR + CSAT + cost triangle

Don't optimize for FCR alone. The right north star is the trio:

FCR up
CSAT stable or up
Cost per resolved issue down

If two move the right direction and one doesn't, investigate. If you're sacrificing one for another, you're probably making a bad trade.

For more on broader metrics, see how to measure voice agent quality.

FAQ

What's the right callback window for FCR? 14 days for general support. 7 days for time-sensitive. 30 days for less urgent.

Should I measure FCR per channel or aggregated? Both. Per-channel for diagnostics; aggregated for executive reporting.

How does FCR interact with proactive outreach? Outbound contacts (the agent calls the customer) shouldn't count toward FCR — different concept.

Can FCR be too high? Suspiciously yes. 95%+ FCR usually means escalation isn't happening when it should.

What about FCR for issues spanning multiple departments? Complex. Some teams compute "first-team resolution" instead — first team that touched the issue resolved it, even if multiple touch.

Why First-Contact Resolution Is the North Star for AI Support

TL;DR

What FCR actually measures

Why FCR is the right north star

How AI affects FCR

How to measure FCR for AI

Same-issue matching

What pushes FCR up

What pushes FCR down

Setting FCR targets

FCR as a leading indicator

What FCR doesn't capture

The FCR + CSAT + cost triangle

FAQ

More from Tyler Weitzman

Open-Source vs Proprietary Voice Agent Stacks

Build vs Buy: When to Build Your Own Voice Agent

Voice Agents for Developer Support

Related reading

How to Calculate ROI for AI Customer Support

How to Tag and Categorize AI Conversations

Cutting Average Handle Time with Voice Agents

Voice AI, twice a month.