Does our FRT metric count AI responses or only human responses?

It should count the first substantive response, which includes AI replies that actually resolve the ticket. If your reporting only counts human responses, it's hiding most of your support performance and optimizing the wrong number. Reconfigure it to log AI-resolved contacts at their real first response time, then segment AI-resolved from escalated.

What's a realistic FRT target for escalated tickets during business hours?

For chat escalations, under 5 minutes is achievable and expected. For email, under 1 hour is the ecommerce standard. After hours, set a specific window tied to your next shift rather than a speed target — customers accept after-hours waits when the commitment is concrete, like 'a specialist will reply by 9 AM Monday.'

We're a small team. How do we keep FRT low when volume spikes?

This is where an AI agent has the most leverage. During a spike, the agent absorbs the deflectable majority — order status, returns, FAQs — instantly, so your human team only sees the smaller share that needs judgment. Their queue never balloons, which means human FRT can actually improve during peak rather than collapse.

Can instant responses be wrong, and does that hurt FRT?

Yes, and it's the trap to avoid. A fast wrong answer just generates a second contact, which raises FRT for the next customer in line. Speed and accuracy are the same project: keep help docs current and your catalog connected so the agent answers both instantly and correctly, and track repeat-contact rate as your early warning.

How fast can an AI agent realistically respond?

For tickets it resolves autonomously, an AI agent typically responds in seconds, every hour of every day, with no queue. The main thing that slows it down is latency in data connections or a thin knowledge base, both of which are fixable. The remaining 30% or so that escalate to a human follow the wait-time and handoff rules above.

Playbooks

Cutting First Response Time to Zero with AI Support

Customers have been conditioned by instant experiences everywhere in digital life. Support is the last holdout. An AI agent that takes real actions closes the gap.

The Bookbag Team·June 2026· 14 min read

In this article

Why first response time matters
How fast should first response be?
What 'zero FRT' actually means
What a useful instant response looks like
Which tickets hit zero first
When customers still need a human
After-hours: set a real commitment
Mistakes that quietly inflate FRT
How to measure and maintain it
How Bookbag gets FRT near zero

Why first response time is worth obsessing over

First response time is the gap between when a customer contacts support and when they get a real, relevant reply. Of all the support metrics, it's the one customers feel most directly. Resolution time, handle time, backlog age — those live on your dashboard. First response time lives in the customer's gut. Someone who sends a message about a missing order and hears nothing for six hours spends those six hours forming an opinion about your brand, and it isn't a good one.

Here's the part operators underrate: speed changes the emotional arc of the whole interaction, not just the opening. Studies of support queues consistently find that a fast first response lifts CSAT even when the final resolution takes exactly as long. The customer who gets answered in seconds and resolved in a day rates the experience higher than the customer who waited six hours for an identical resolution. The acknowledgment does work the resolution can't — it tells the customer they were heard.

For ecommerce specifically, the stakes are sharper because so many contacts are time-sensitive by nature. 'Where is my order,' 'cancel before it ships,' 'does this come in a medium' — these aren't idle questions. A slow answer doesn't just annoy; it loses the sale, triggers a chargeback, or sends the buyer to a competitor's tab that's already open.

There's also a compounding cost most teams never put a number on. A customer who waits too long doesn't just sit quietly — they re-contact. They send a second email, open a chat, then DM you on Instagram, and now one question has become three tickets your team has to reconcile. Slow first response manufactures volume. Fast first response is the cheapest deflection lever you have, because the question that gets answered immediately never spawns its three impatient siblings.

Definition

First response time (FRT) measures how long a customer waits for the first substantive reply after reaching out. Critically, that reply can come from an AI agent. If the agent resolves the question in one exchange, FRT for that contact is effectively zero — and so is resolution time.

How fast should first response actually be?

The honest answer is that the bar depends entirely on the channel, and customer expectations have tightened every year. Live chat is judged in seconds; email is judged in hours; social DMs sit somewhere in between but are trending toward chat-speed. The benchmarks below are drawn from industry studies of customer support response times, not Bookbag's own data — use them to set targets, not to grade yourself against a single vanity average.

Two findings cut through the noise. First, expectation badly outruns reality: surveys find roughly 89% of customers expect a reply within an hour, while the cross-industry email average still hovers around 12 hours. Second, speed and retention move together — companies that respond to email within an hour see materially higher retention than those that take a full day. The gap between what customers want and what most stores deliver is exactly the space an AI agent occupies.

One nuance gets lost in the averages: a median FRT can look respectable while your worst-case FRT quietly does the damage. If you answer most chats in 90 seconds but your overnight and weekend tickets sit for ten hours, your customers experience the ten hours, not the median. Those are precisely the contacts an always-on agent erases. When you benchmark yourself, look at the tail — your slowest 20% of responses — not just the comfortable middle of the distribution.

Channel	Typical industry FRT	What customers expect	Realistic AI-agent FRT
Live chat	1.5-3 min during hours, none after	Under 2 min; CSAT peaks at 5-10 sec	Seconds, 24/7
Email	~5-12 hours	Within 1 hour (most prefer faster)	Seconds to minutes, 24/7
WhatsApp / SMS	Minutes to hours	Near-instant, chat-like	Seconds, 24/7
Instagram / Messenger	Hours, often next day	Within an hour	Seconds, 24/7
Phone / voice	Queue-dependent	Under 2 min hold	Instant pickup on supported tiers

Benchmark

Industry studies report live-chat CSAT peaking near 85% when the first reply lands within 5-10 seconds, and roughly 57% of customers abandoning a chat after a 3-minute wait. The window for 'fast' is narrower than most teams assume — and after hours, the human window is zero.

What 'zero first response time' actually means

Zero FRT doesn't mean a robot fires off a canned 'Thanks, we got your message' the instant a ticket lands. That's a fast non-answer, and customers see straight through it. Zero FRT means the customer asks a real question and gets a real, resolving answer in the same breath — no queue, no ticket number, no waiting for a human to wake up.

This is the difference between a scripted chatbot and an AI agent. A scripted bot matches keywords to canned flows and, when the flow runs out, dumps the customer into a queue — so its 'instant response' is usually just a faster handoff. An AI agent reasons over your help docs and live store data, looks up the actual order, and takes the actual action. When a buyer asks where their package is, the agent reads the order, pulls the live tracking, and answers with the carrier, last scan, and delivery date. First response and resolution happen in one message.

Because an agent has no business hours, that capability runs around the clock at identical quality. The 11 PM Saturday order-status question gets the same instant, data-backed answer as the 2 PM Tuesday one. For the share of contacts an agent resolves on its own — commonly up to ~70% for ecommerce stores with clean catalog and order data — FRT stops being a metric you defend and becomes one you can advertise.

It's worth being precise about what drives that number, because it isn't magic. An agent hits near-zero FRT only when it can reach the data the answer depends on. WISMO needs a live order and carrier connection; returns need your policy and a label provider; product questions need a catalog the agent can actually read. The stores that get to instant fastest are the ones that connect those sources up front. The stores that struggle usually have a thin knowledge base or a disconnected catalog, which forces the agent to punt questions it could otherwise close on the first reply.

Instant, every hour: no overnight backlog forming while your team sleeps.
Consistent: the same answer quality at peak as at 3 AM, with no fatigue or rushing.
Single-touch: response and resolution collapse into one exchange for deflectable tickets.
Self-funding speed: faster answers on pre-sale questions recover carts the slow queue would have lost.

What a useful instant response actually looks like

Speed without substance is just a faster brush-off. For an instant response to count, it has to do three things at once: answer the specific question with real information, take any action the answer implies, and leave a clean path to a human if the customer needs one. Miss the first and you've sent boilerplate. Miss the second and you've created a second ticket. Miss the third and you've trapped someone in a loop.

Compare the two columns below. The left is what a keyword bot or an auto-acknowledgment produces. The right is what an agent connected to your store and policies produces. Same response time on the clock; completely different outcome for the customer and your queue.

Ticket	Fast non-answer (avoid)	Useful instant response (aim for)
Order status	"Thanks! We'll look into it."	"Order #4821 shipped via UPS, last scanned in Memphis, arriving Thursday June 4."
Return request	"Please see our returns page."	Eligibility checked, return approved, prepaid label emailed in the same chat.
Product fit	"Check the product description."	"The Trail Jacket runs true to size; for layering, size up. Medium is in stock."
Shipping timeline	"Ships in 3-5 business days."	"Order before 2 PM ET today and it ships today; delivery to 10001 is ~2 days."

Rule of thumb

If the customer has to take a second action after your 'instant' reply — click a link, re-explain, wait for a human to do the lookup — it wasn't a resolution. It was a deferral wearing a fast timestamp.

Which tickets hit zero FRT first

Not every contact is equally easy to answer instantly, and pretending otherwise is how teams get burned. The smart sequence is to push the high-volume, data-backed, repetitive tickets to zero FRT first — they're the bulk of your queue and the lowest risk to automate — then expand into judgment-heavier cases with clear guardrails.

WISMO (where-is-my-order) is almost always the single largest bucket and the easiest win: it's a lookup, not a judgment call. Returns, refunds, and exchanges follow once you've set the rules and caps the agent must respect. Pre-sale product questions are pure upside, because a fast, accurate answer there often converts a hesitant browser into a buyer. The ranking below reflects what most ecommerce stores see.

Sequencing this way also protects your team's trust in the system. Start the agent on the high-confidence, low-risk buckets where instant answers are unambiguously correct, prove the FRT win there, then widen the scope. Trying to automate complaints and goodwill decisions on day one is how stores end up yanking the agent back and reverting to a slow human queue — the opposite of the outcome you want.

Ticket type	Share of queue (typical)	Instant-resolution fit
WISMO / order tracking	30-50%	Very high — live order + carrier lookup
Returns / exchanges	15-25%	High — within merchant rules and caps
Refund status (WISMR)	5-10%	High — status pulled from order record
Pre-sale product questions	10-20%	High — answered from catalog, drives revenue
Discounts / promo questions	5-10%	Medium — policy-dependent
Complaints / damaged items	5-10%	Lower — often needs human judgment or goodwill

WISMO reduction playbook Automate returns and exchanges

When customers still need a human

For the contacts that escalate — typically the harder 30% an agent shouldn't resolve alone — FRT still matters, but the standard shifts. The agent should give a genuine, instant first response and then set an honest expectation for the human follow-up. The customer should never feel dropped into silence between the AI answer and the human pickup.

The worst experience in a hybrid setup isn't the wait itself; it's the unannounced wait. A customer who gets an instant agent reply and then sits for 45 minutes with no word feels more abandoned than one who was told upfront, 'a specialist will reply within the hour.' Proactive, specific wait communication is what keeps an escalation from curdling into a complaint.

1When escalating, state the expected wait based on the live queue. Say '4 minutes' or 'under 30 minutes,' never 'shortly.'
2If the wait will exceed 15 minutes during business hours, offer an async option: 'I can have a specialist email you within the hour instead.'
3At the 15-minute mark in queue, auto-send a status update so the customer knows they weren't forgotten — this alone cuts abandonment and repeat contacts.
4Pass full context to the human: the customer's question, the order, and what the agent already tried, so the human doesn't re-interrogate the customer.
5If a human picks up faster than promised, that's a positive surprise. Always under-promise the wait, never under-state it.

Why context transfer is the hidden FRT

A 90-second human pickup feels slow if the agent makes the customer repeat everything. Handing the human the full conversation and order detail means the human's first reply is also their resolving reply — the second FRT in the interaction effectively drops to zero too.

After-hours: set a real commitment, not 'soon'

After hours is where an AI agent earns its keep, because the human alternative is a void. Autonomous resolutions keep happening instantly — the 11 PM order-status question is answered, the return label goes out, the refund status is read back — exactly as during the day. The only thing that changes is what happens on escalations, and that's purely a matter of expectation-setting.

When the agent can't resolve something at night and no human is on, the message should do four things: acknowledge the request, confirm it's logged, state precisely when a human will reply, and confirm the contact details. 'A specialist will reply by 9 AM ET Monday, June 30' beats 'we'll get back to you tomorrow' every time, because it converts an open-ended wait into a kept promise. Vague commitments are how after-hours tickets become next-day complaints.

The strongest demonstration of after-hours value is the multi-step action the agent completes while everyone sleeps. A customer starts a return at 11 PM; the agent checks eligibility, approves it, and emails the prepaid label. The customer wakes up done — no wait, no human, no follow-up. That single moment does more for CSAT and repeat purchase than any apology email ever will.

Offer 24/7 support without hiring

Mistakes that quietly inflate first response time

Most FRT problems aren't dramatic outages — they're small structural choices that add minutes and hours nobody is watching. The four below are the ones we see most often when a store's reported FRT looks fine but customers still complain about slowness.

The sneakiest is a reporting blind spot: counting only human replies in your FRT metric. If your dashboard ignores AI-resolved contacts, you're hiding the majority of your support performance and optimizing the wrong number. Fix the measurement before you chase the metric.

Counting only human responses in FRT, so instant AI resolutions never show up — your real FRT is far better than the dashboard says, or far worse, and you can't tell which.
An auto-acknowledgment that resets the FRT clock without answering anything, letting you 'hit' a target while customers still wait for substance.
Stale or thin help docs and a disconnected catalog, which force the agent to escalate questions it should have answered — turning instant tickets into queued ones.
No queue-aware wait messaging, so escalated customers re-contact across email and chat, inflating volume and pushing FRT up for everyone behind them.

The accuracy-speed link

Speed and accuracy aren't a trade-off here — they're the same project. An agent that answers wrong fast just generates a second contact, which raises FRT for the next person in line. Good knowledge coverage is what lets an agent be both instant and right.

How to measure and maintain near-zero FRT

What gets measured gets defended, so report FRT weekly and split it by where the response came from. A single blended average hides the story; segmenting it tells you exactly where to act. The targets below are practical starting points, not Bookbag guarantees — tune them to your channels and volume.

The compounding benefit is worth naming: when an agent resolves a large share of tickets instantly, your human team's FRT on the remainder improves too, because there's far less volume competing for their attention. Deflection and speed reinforce each other. During a BFCM-style spike, the agent absorbs the deflectable surge so the human queue never balloons in the first place.

Separate AI-resolved FRT from escalated FRT so instant wins don't mask a slow human queue (or vice versa).
Watch repeat-contact rate alongside FRT — if speed rises but repeats rise too, you're answering fast and wrong.
Review the agent's escalation reasons monthly; recurring escalations point to a knowledge gap you can close to push more tickets to zero.
Retrain the agent after catalog changes, new policies, or peak season so its instant answers stay accurate.

Metric to track	Why it matters	Practical target
AI-resolved FRT	Catches latency in data connections or config	Under 10 seconds
Escalated FRT (in hours)	The human experience, isolated from AI	Under 5 min chat / under 1 hr email
After-hours follow-through	SLA compliance, not raw speed	100% within committed window
FRT by channel	Each channel has its own baseline	Manage chat, email, social separately
Repeat-contact rate	Catches fast-but-wrong answers	Trending down quarter over quarter

How Bookbag drives first response time toward zero

Bookbag is an AI customer support agent built for Shopify and ecommerce, and near-zero FRT is the natural result of how it's wired, not a separate feature you toggle on. You connect your store, import your help docs and website, and drop a one-line widget on your site. From that point the agent answers from live order data and your real policies — so the first response to a WISMO, return, or product question is also the resolution, around the clock.

It works the same way across channels: website chat, email, WhatsApp, Instagram DM, Facebook Messenger, and Slack, with voice on higher tiers. When a ticket genuinely needs a person, the agent hands off to your shared inbox with the full conversation and order context attached, so the human's first reply can resolve instead of re-interrogate. Pricing is flat monthly plans with message-credit allowances and a spend cap you set — no per-resolution fees, so getting faster never quietly raises your bill.

Bookbag isn't the cheapest help desk on the market, and a tiny store fielding a handful of tickets a week may not need it yet. But for any store where customers contact you after hours, during spikes, or faster than a human queue can keep up, an agent that resolves the deflectable majority instantly is the most direct way to take FRT off your list of problems.

See Bookbag pricing Compare Bookbag vs Gorgias

Key takeaways

First response time shapes the emotional arc of the whole interaction — a fast first reply lifts CSAT even when final resolution takes the same time.
For up to ~70% of ecommerce tickets, an AI agent collapses first response and resolution into one instant exchange, 24/7.
Instant only counts if it's useful — a specific answer from live order and catalog data, not a boilerplate acknowledgment.
On escalations, state a specific wait time and pass full context to the human so their first reply also resolves.
After hours, commit to a real time ('by 9 AM Monday'), and let the agent complete multi-step actions like return labels overnight.
Measure AI-resolved FRT and escalated FRT separately, and watch repeat-contact rate so fast answers don't mask wrong ones.

Cutting First Response Time to Zero with AI Support

Why first response time is worth obsessing over

How fast should first response actually be?

What 'zero first response time' actually means

What a useful instant response actually looks like

Which tickets hit zero FRT first

When customers still need a human

After-hours: set a real commitment, not 'soon'

Mistakes that quietly inflate first response time

How to measure and maintain near-zero FRT

How Bookbag drives first response time toward zero

Key takeaways

Frequently Asked Questions

Keep reading

First Response Time Benchmarks by Channel (2026)

The Ticket Deflection Playbook for Ecommerce

How to Offer 24/7 Customer Support Without Hiring More Staff

Customer Service Response Time Expectations by Channel (2026)

How to Improve CSAT with AI Support: A Practical Playbook

Turn support into your competitive edge