nothing like a six-week experiment that cost $50k to teach you your 'brilliant' idea is actually garbage. Performance marketer at a B2B SaaS - we thought aggressive personalization in cold outbound would be a slam dunk. More data points = more relevance = better replies. standard logic, right?
set up a split test: control group got our usual 1-2 personalisation points per email, experiment group got 4-6 - subject line, opener, middle, close, all referencing specific prospect or company data. Sourced from Apollo and LinkedIn signal tools, all human-curated, no AI slop. Expected reply rate to jump from 6% to 9%.
What actually happened? Reply rate dropped to 4%. And the quality of those replies was worse - more "wait, how do you know that about me?" and fewer actual conversations. positive reply rate halved.
We dug into the data and talked to the people who replied negatively. Three patterns emerged:
Creepy threshold. There's a line between "you did your homework" and "you're surveilling me." "You posted about X last Tuesday" is fine. "You posted about X, commented on Y, and your company just hired for Z" feels like stalking. We crossed that line.
Pattern recognition. B2B buyers have seen a million AI-personalised emails by now. even though our personalisation was 100% human, the density of references triggered their "this is AI garbage" detector.
Reduced trust in individual claims. five personalisation points and one is slightly off? That one error becomes everything. with one point, they trust it more.
So the takeaway: personalisation has a U-shape relationship with reply rate. Zero is bad, too much is worse. We pulled back to 1-2 points, made them hyper-specific rather than numerous, and killed the deep personalisation pipeline.
cost us about $40-60k all in. Painful, but educational. if you're testing personalisation depth, for the love of everything test against a control. look at reply quality, not just volume. And talk to the people who told you to bugger off - they'll tell you where you went wrong.
the growth community loves to say "more personalisation always wins." No. It depends on your prospect's AI-outbound fatigue, how specific your points are, and the trust level in your category. Not intuitive, but real.
Negative results are where the learning's at. tired of seeing only success stories on here - failures teach you way more.