The pitch is always the same. Connect your CRM, upload a lead list, let AI write personalised outreach at scale. Results in days. Pipeline in weeks. Your SDR quota finally within reach.

Then 60 days in, the reply rate is 0.4%. The spam complaints are up. The tool rep is asking if you've "tried a different sequence." You cancel and tell your team it wasn't the right fit.

This is not a rare outcome. It is the modal outcome. Somewhere between 60 and 75% of AI SDR tool subscriptions get cancelled within the first 90 days. The churn isn't because the tools are fraudulent. Most of them work exactly as described. The problem is that what they do — send AI-generated outbound at volume — is the wrong answer to the right question.

Why Volume-First Fails

Here is what "AI SDR at scale" actually does in production:

You export a list of 2,000 contacts that match your ICP criteria — industry, company size, job title. The tool generates emails for all 2,000. They go out. Some are personalised with the company name and maybe a recent news item the AI scraped. The rest are variations on the same template.

The contacts receiving these emails are also receiving emails from the 47 other companies that bought the same contact list, ran the same ICP filter, and are using the same three AI SDR platforms. The personalisation that felt differentiated when you previewed it is identical in structure to the three emails they got from competitors last Tuesday.

Low engagement is the predictable result. And low engagement is a problem that compounds.

The spam filter feedback loop Gmail and Outlook's deliverability algorithms are engagement-based. Low open rates → low reply rates → low click rates → spam classification. Once your domain starts landing in spam, your reply rates drop further. Which produces even lower engagement scores. Which deepens the spam classification. AI made it cheaper to trigger this loop at 10× the previous scale.

The teams that churn didn't fail to use the tool correctly. They used it exactly as marketed. The tool just scaled an approach that was already broken.

What Volume-First vs. Research-First Looks Like

Dimension Volume-first Research-first
List size 2,000+ contacts 50–200 qualified prospects
Qualification criteria ICP filter (title + industry + size) ICP + behavioural signals (hiring, funding, tech, timing)
AI role Email generation Research, qualification, draft — human approves
First touch basis You match our ICP You just raised Series B + are hiring SDRs = relevant right now
Reply rate (typical) 0.3–0.8% 3–8%
Deliverability trajectory Degrades over time Stable (low volume, high engagement)

The numbers in that last row are not aspirational. They reflect what happens when you contact fewer people more relevantly. The arithmetic of outbound hasn't changed. What AI can change is how much of the research burden falls on a human SDR before the first email goes out.

The 3 Signals That Separate Tools People Keep From Tools They Cancel

After watching a lot of teams set this up, the pattern is consistent. The 30% who don't churn do three things differently from the start:

What AI Can and Can't Do in Outbound (Honest Take)

AI is genuinely good at three things in outbound: finding structured signals at scale, generating first drafts that incorporate those signals, and doing it faster than any human could. A tool that monitors 500 companies for funding events, role changes, and hiring patterns — and surfaces the relevant ones for review each morning — is doing something valuable that wasn't economically possible before.

AI is bad at one thing that matters enormously: knowing whether to send the email. The signal is real. The contact is real. The draft is coherent. But is this the right week? Is this person overextended right now? Is their company in the middle of a restructure that makes new vendor conversations a non-starter? Does the email's opening line feel accurate or will it read as AI-generated to someone who knows their own situation better than any model does?

The teams that churn treat AI as an autonomous outbound system. The teams that don't treat it as a research and drafting layer that makes a human SDR faster and better-informed. The difference sounds subtle. The outcomes are not.

The tool isn't the problem If your AI SDR tool has a 0.4% reply rate, you didn't buy a bad tool. You bought a tool that scales outbound and then used it on a strategy that doesn't produce replies at scale. That's a strategy problem. The tool is doing what it promised.

Why This Keeps Happening

The demo always shows the best case. A researched email to a relevant prospect that happens to reply. The pricing is per-seat or per-send, which creates an incentive to send more, not better. The onboarding is built around getting to your first send quickly, not getting to your first qualified prospect correctly.

Nobody in the sales cycle for an AI SDR tool is incentivised to tell you to send fewer emails. The whole model is built on volume. When you churn at 90 days, they've already made two or three months of revenue and they'll sell to the next team with the same pitch.

The 30% who stick around figured this out — usually by accident, after their first send underperformed and they slowed down to ask why. Slowing down to qualify better before sending is the move. It's just not the default.


Drumroll is built on the assumption that the problem isn't insufficient volume — it's insufficient qualification before the first email goes out. Research-first means using AI to validate fit and surface signals, letting a human make the call, and then sending fewer emails to people who are actually relevant right now. The reply rates are better. The domain health is better. The pipeline is better. And you don't cancel at 90 days wondering what went wrong.

Research first. Send second.

Drumroll qualifies before it emails. Your SDRs approve every send. Free during beta.

You're in — we'll be in touch.

No spam · Just a heads-up when your spot is ready