Skip to content
AEOeye
All articles
Technical

Structured Data for AI: The Schema That Makes You Quotable

Jun 25, 2026·8 min read
Detailed view of Ruby on Rails code highlighting software development intricacies.
Photo by Digital Buggu on Pexels

There's a stubborn myth going around: that large language models don't read your schema markup, so structured data is dead weight for AEO. It's half true, and the half that's false is the half that costs you citations.

Here's the actual mechanism. The reasoning model answering a question in ChatGPT or Perplexity usually never sees your raw JSON-LD. But the retrieval layer that decides which pages get fed to that model often does. Google's AI Overviews lean directly on the same structured-data pipeline that powers rich results. Perplexity's crawler parses JSON-LD to build its index. Bing — which underpins Copilot and a chunk of ChatGPT's web tool — has consumed schema for a decade. Structured data isn't what makes the model smart. It's what makes you findable, parseable, and unambiguous at the exact moment a fact gets extracted.

So the question isn't "do LLMs read schema." It's "when an answer engine tries to state a fact about my brand, can it pull a clean, machine-verified value, or does it have to guess from prose?" Schema is how you stop it guessing.

Why structured data changes extraction, not ranking

Think about what an answer engine does differently from a blue-link search engine. Old Google ranked pages. Answer engines extract claims and then attribute them. To say "Acme's starter plan is $29/month," the system needs to (a) be confident that's the current price and (b) trust the source enough to repeat it.

Unstructured prose forces probabilistic reading. "Plans start at just $29" buried in a paragraph might be current, might be a 2022 blog post, might be a competitor's quote. A Product with an Offer carrying price: "29.00" and priceCurrency: "USD" is a typed assertion. One is a hint. The other is a fact with a data type attached.

The brands winning citations in 2026 aren't the ones with the most content. They're the ones whose key facts — price, founding date, what the product actually does — exist in a form a machine can lift without interpretation. Ambiguity is the tax you pay for skipping schema.

That's the whole game. Structured data converts your marketing copy into facts that survive the trip from your page into a generated answer with their meaning intact.

The four schema types that earn their keep

Most schema guides list forty types. For AEO, four do the heavy lifting. Implement these well before you touch anything else.

Organization — your identity anchor

This is the most underrated type for AI visibility, because it's the one that tells answer engines who you even are. When someone asks "is Acme legit" or "who makes Acme," the model is assembling an entity profile. Organization schema is where you hand it the canonical version instead of letting it stitch one together from scattered mentions.

The fields that matter:

  • name and legalName — the exact string you want quoted, plus the formal one
  • url, logo — identity basics
  • sameAs — links to your LinkedIn, Crunchbase, Wikipedia, Wikidata, X profile. This is the single highest-leverage field in the whole spec.
  • foundingDate, founder, numberOfEmployees — the biographical facts that show up in "tell me about X" answers
  • description — a tight, factual one-sentence summary

sameAs deserves its own paragraph. Entity resolution is the process by which an AI decides that the "Acme" on your site, the "Acme Inc." on LinkedIn, and the "Acme" in a news article are the same thing. sameAs is you drawing that line explicitly. Without it, the model does the matching itself and sometimes gets it wrong — merging you with a similarly named company, or splitting your reputation across two phantom entities. Point sameAs at your authoritative profiles and you collapse that ambiguity.

{
  "@context": "https://schema.org",
  "@type": "Organization",
  "name": "Acme Analytics",
  "legalName": "Acme Analytics Inc.",
  "url": "https://acme.com",
  "logo": "https://acme.com/logo.png",
  "foundingDate": "2019-03-01",
  "description": "Acme Analytics is a product analytics platform for B2B SaaS teams.",
  "sameAs": [
    "https://www.linkedin.com/company/acme-analytics",
    "https://www.crunchbase.com/organization/acme-analytics",
    "https://en.wikipedia.org/wiki/Acme_Analytics"
  ]
}

Product — the facts buyers ask for

When a comparison query fires ("best X for Y"), the answer engine wants structured attributes it can line up in a table: price, rating, availability, the specific thing the product does. Product markup hands those over directly.

Use Offer for pricing — and keep it synced with your actual price, because a stale price value is worse than none. An AI confidently quoting your 2023 pricing erodes trust the moment a user clicks through. Include aggregateRating only if it reflects real reviews; fabricated ratings are both an ethics problem and a manual-action risk.

The field people skip and shouldn't: description and additionalProperty. A query like "does Acme integrate with Snowflake" gets answered from explicit capability statements. Encode meaningful attributes as additionalProperty (PropertyValue pairs) rather than burying them in a feature wall.

FAQPage — pre-chewed Q&A the model loves

FAQPage is the most AEO-native schema type, full stop. It's literally question-and-answer pairs — the exact shape an answer engine is trying to produce. You're not hoping the model extracts a Q&A; you're handing it one.

Two rules. First, write the question the way a user would phrase it to an AI, not the way your marketing team would: "How much does Acme cost?" not "Pricing & Plans." Match the conversational query. Second, make the answer self-contained and quotable — one or two sentences that stand on their own without the surrounding page. If the answer only makes sense with context the model won't have, it won't get used.

Don't stuff twenty FAQs on every page. Five to eight genuinely-asked questions per page, each answering one thing cleanly, beats a wall of keyword-bait questions that read like they were generated for robots. (They were. Everyone can tell.)

Review — the social proof AI surfaces

When someone asks "is Acme any good," the model reaches for sentiment and specifics. Properly marked-up Review content — with author, reviewRating, reviewBody, and a clear itemReviewed — gives it quotable, attributable opinion. The attribution matters: an AI is far more comfortable repeating "rated 4.6 by 200 users on G2" than an anonymous claim of greatness. Mark up reviews you legitimately host, and make sure third-party review presence (G2, Capterra, Trustpilot) is strong too, because that's where a lot of this gets corroborated.

Implementation that doesn't bite you later

A few hard-won specifics.

Use JSON-LD, not microdata. Inline microdata tangles your markup into your HTML and breaks every time a designer touches the template. JSON-LD lives in a single <script type="application/ld+json"> block, is trivial to template, and is what Google explicitly prefers. There's no real argument here in 2026.

Make schema match the visible page. This is non-negotiable. If your Offer says $29 but the page shows $49, you're not just risking a penalty — you're feeding the answer engine a contradiction it will resolve against you by trusting neither. Schema describes what's on the page. It is not a side channel for claims you can't back up in the body.

Connect entities with @id. Give your Organization a stable @id (e.g. https://acme.com/#organization) and reference it from your Product's brand and your Article's publisher. This builds an entity graph across your site instead of isolated islands of markup. Answer engines doing entity resolution reward the graph.

Validate, then watch. Run every template through Schema.org's validator and Google's Rich Results Test. They catch the silent failures — a wrong enum, a date in the wrong format — that make a whole block get ignored. Validation tells you the syntax is right. It doesn't tell you whether AI assistants are actually citing you, which is a different question entirely. That gap — between "my markup is valid" and "ChatGPT recommends me" — is exactly what AEOeye's free audit was built to measure: it checks how AI assistants currently see and describe your brand, so you can tell whether the schema work is translating into real visibility.

A pragmatic priority order

If you're starting from zero, do it in this sequence. Each step compounds.

  1. Organization schema on your homepage, with a complete sameAs array. This is your identity foundation; nothing works as well without it.
  2. FAQPage on your highest-intent pages — pricing, product, the "how does it work" page. Phrased as real questions.
  3. Product schema with live, accurate pricing on every product/plan page.
  4. Review markup where you host legitimate reviews, plus a push to strengthen third-party review profiles.

Then close the loop. Markup is an input; the output is whether you show up in answers. Ship the schema, give the crawlers a few weeks, and re-audit how the major assistants describe you. If the facts they repeat now match the facts you encoded, the schema is doing its job. If they're still guessing — wrong founding date, vague description, a price you retired last year — you've found exactly where your structured data is thin, and exactly what to fix next.

Schema won't make a mediocre product get recommended. But it will make sure that when an answer engine reaches for a fact about you, it reaches for the right one — stated the way you'd state it, attributed to the source you control. In a world where AI is doing the quoting, being quotable is a technical property. Structured data is how you earn it.

FAQ

Do AI models like ChatGPT actually read schema markup?+

The reasoning model usually doesn't see your raw JSON-LD directly, but the retrieval and indexing layers that decide which pages get fed to it often parse schema heavily. Google AI Overviews, Perplexity's crawler, and Bing (which powers Copilot and part of ChatGPT's web tool) all consume structured data. Schema makes your facts findable and unambiguous at the moment of extraction, even if the LLM itself never parses the JSON.

Which schema.org types matter most for AEO?+

Four do the heavy lifting: Organization (your identity anchor, especially the sameAs field for entity resolution), FAQPage (pre-formatted Q&A that matches what answer engines produce), Product (price, ratings, and capabilities for comparison queries), and Review (attributable social proof). Implement these well before touching the dozens of other available types.

Why is the sameAs field so important?+

sameAs links your site to authoritative profiles like LinkedIn, Crunchbase, Wikipedia, and Wikidata. It explicitly tells answer engines that all these mentions refer to the same entity, which prevents the model from merging you with a similarly named company or splitting your reputation across phantom entities. It's the single highest-leverage field for AI entity resolution.

Should I use JSON-LD or microdata for structured data?+

Use JSON-LD. It lives in a single script block, is easy to template, survives design changes, and is what Google explicitly prefers. Microdata tangles markup into your HTML and breaks whenever the template changes. In 2026 there's no real argument for microdata over JSON-LD.

Can structured data backfire?+

Yes, if it contradicts your visible page. Schema that claims a price, rating, or fact not shown in the body can trigger manual penalties and, worse, feeds answer engines a contradiction they resolve by trusting neither value. Always keep markup synced with what's actually on the page, and never fabricate ratings or attributes.

How do I know if my schema is actually improving AI visibility?+

Validation tools confirm your syntax is correct but not whether AI assistants cite you. Those are different questions. Ship the schema, wait a few weeks for crawlers, then audit how assistants describe your brand. AEOeye's free audit measures exactly this gap between valid markup and real AI visibility, showing whether the facts assistants repeat match the ones you encoded.

Is AI recommending you?

Run a free AI visibility audit and find out in under a minute.

Free · No signup · Results in under a minute

Keep reading