Yep, that's the classic AI SDR rollercoaster. The model itself is rarely the bottleneck - it's everything else around it that's broken.
From what I've seen, you pretty much nailed the top five: dodgy source data, half-arsed enrichment, no "why this company, why now" logic, zero human QA on claims, and a handoff process that might as well be a black hole.
The "personalisation" that's actually just AI making stuff up? That's the silent killer. Looks clever to the intern, but the prospect catches it immediately. I'd slap a hard rule on it: if the AI can't show you the exact source for a claim, that claim doesn't go anywhere near the email.
The setups that actually work aren't "AI SDR replaces SDR". They're more like: AI finds and preps candidate accounts, enrichment checks the contact and the trigger, a human sanity-checks a small sample for truth, then the sequence fires only if the reason to reach out is genuinely specific. Replies still get routed to a human fast.
Volume is only helpful once the data layer is boring and reliable. Otherwise you're just scaling up embarrassing mistakes at speed.