Most of what I'm seeing in the thread misses the mark on methodology. Worth digging back into the study design.
They didn't test a single site. The tracked group was nearly 1,900 pages that added JSON-LD over a period, matched against roughly 4,000 control pages pulled from different domains with similar pre-existing citation counts. That's the opposite of a single-site experiment. They started with millions of URLs to find the ones that actually made the transition from no-schema to schema.
As for the "only part of a site has schema" argument - the study isn't measuring whether the site as a whole is better understood. It's comparing each page's AI citations over a 30-day window, accounting for broader platform trends with a difference-in-differences approach. Whether schema lives on half the pages or all of them is irrelevant to that design.
The disambiguation point also falls apart when you look at their parallel experiment. A separate test ran on multiple AI systems - ChatGPT, Claude, Perplexity, Gemini, AI Mode - and none of them read schema markup during real-time page fetch. Every single one pulled only the visible HTML. JSON-LD, hidden Microdata, hidden RDFa - all ignored at retrieval. If the AI isn't consuming the schema at the moment it decides what to cite, then writing it to clarify "Sandy Shores" is like whispering directions to someone wearing noise-cancelled headphones.
The "bad schema hurts" claim has no real data behind it. They observed a decline of roughly 4.6% in AIO citations for treated pages. But the absolute number is tiny - around a dozen daily citations out of pages that already get hundreds. Both groups were declining together before the change. They can't pin it on schema with any confidence. Honest reading: schema didn't help, might have slightly hurt, but the data isn't strong enough to say either way.
That 53% correlation - schema being present on cited pages - is just a reflection that sites investing in structured data also invest in everything that actually drives citations: solid content, authority, relevance. Strip the schema out and those other signals still carry the page. It's a vanity metric, not a causal lever