Skip to content
AEOeye
All articles
Strategy

The Brand Visibility Metrics That Matter in the AI Era

Jun 25, 2026·8 min read
Abstract visualization of data analytics with graphs and charts showing dynamic growth.
Photo by Negative Space on Pexels

There's a moment most marketing teams hit in 2026. Someone asks ChatGPT to recommend a tool in your category, your brand doesn't come up, and a competitor you barely take seriously does. The room goes quiet. Then someone asks the question that should have been answered six months earlier: how would we even know if this is happening at scale?

That question is the whole game. AI assistants have become a recommendation layer between buyers and brands, and that layer is mostly opaque. You can't see it in Google Analytics. It doesn't show up cleanly in your CRM. And the old SEO instinct — check the ranking, check the traffic — measures the wrong thing entirely. You need a different scorecard.

This is a practical guide to that scorecard: the five metrics that actually tell you whether AI assistants know and trust your brand, how to track each one over time, and what a good number looks like.

Why your old metrics are lying to you

Keyword rankings tell you where a blue link sits on a results page. But when someone asks Perplexity "what's the best project management tool for a 12-person agency," there is no page of ten links. There's a synthesized answer that names two or three products, cites a handful of sources, and frames each one with an adjective. You are either in that answer or you are invisible. Position 7 doesn't exist.

Organic traffic is worse as a proxy, because the most valuable AI interactions never produce a click. The buyer reads the recommendation, forms an impression, and clicks through later via a branded search or types your URL directly. By the time the visit lands in your analytics, the attribution is gone. Your traffic can look flat while your brand is quietly being recommended — or quietly being dropped.

The single biggest measurement mistake I see: teams treat AI visibility as a traffic problem when it's a reputation problem. You're not optimizing for clicks anymore. You're optimizing for whether a machine vouches for you when a buyer asks. Measure the vouching.

The five metrics that matter

1. Mention rate

What it is: the percentage of relevant prompts where your brand appears in the answer at all. Run a fixed set of buyer questions across the assistants that matter to you, count how many name you, divide.

This is your foundation metric. Before share of voice, before sentiment, before anything — does the model know you exist in this context? Mention rate answers that.

The discipline is in the prompt set. Pick 30 to 50 questions a real buyer would ask: category questions ("best CRM for solar installers"), comparison questions ("Notion vs Coda for a small team"), problem questions ("how do I stop losing leads from my website"), and a few branded ones ("is Acme any good"). Lock that list. Run it on a schedule — weekly is plenty — across ChatGPT, Perplexity, Google AI Overviews, Gemini, and Claude. The list has to stay fixed, or you're measuring prompt noise instead of brand movement.

What good looks like: for a category you genuinely compete in, a healthy challenger sits around 30-50% mention rate. Category leaders hit 70%+. If you're under 15% on questions you should obviously appear in, you have a discovery problem — the models don't have enough corroborated information to surface you confidently.

2. Share of voice in AI answers

Mention rate tells you if you show up. Share of voice tells you how much room you take up relative to competitors. When an answer names five tools and you're one of them, you have 20% share of voice on that prompt. Average it across your prompt set and you get a single number you can put on a slide.

This is the metric that survives contact with an executive. "We appear in 38% of relevant AI answers and hold 22% share of voice versus the category leader's 41%" is a sentence a CMO can act on. It frames the gap as competitive, not abstract.

Track it as a stacked trend over time — your share against your top three named rivals. The shape of that chart tells you more than any single reading. Is your slice growing while a competitor's shrinks? You're winning the corpus. Are you flat while a new entrant climbs fast? Someone just published something the models love, and you should find out what.

3. Average rank within the answer

Order matters more than people expect. The first brand named in an AI answer carries the implicit weight of "the default choice." Being mentioned third, after two competitors, in a hedged sentence, is a materially weaker position than leading the list — even though both count as a "mention."

So track position. For every answer that names you, record where you fall in the sequence of brands. Average it. A brand that's always mentioned but always last has a positioning problem, not a discovery problem, and the fix is completely different.

What good looks like: consistent placement in the top two named options for your core category prompts. If your mention rate is high but your average rank is 4th or 5th, the models see you as a viable-but-secondary option. That's a content and authority gap — you need more sources framing you as a leader, not just a participant.

4. Sentiment

A mention is not automatically a good mention. Models attach framing: "Acme is a solid budget option" lands very differently from "Acme is the most reliable choice for enterprise teams," and "Acme has had reliability complaints" is actively costing you deals.

Classify the sentiment of each mention as positive, neutral, or negative, and watch the mix. Most mentions are neutral — that's normal. What you're hunting for is two things: the share of clearly positive framing (your asset) and any recurring negative framing (your emergency). Negative sentiment in AI answers is uniquely dangerous because it's stated with the calm authority of a neutral narrator, and buyers tend to believe it.

When you find a recurring negative — "limited integrations," "steep learning curve," "pricing isn't transparent" — trace it to its source. Usually it's a specific review roundup, a Reddit thread, or your own outdated docs that the model is leaning on. Fix the source material and the framing follows, though it lags by weeks.

5. Competitor gap

The first four metrics are about you. This one is about the distance between you and the brands beating you, measured the same way for everyone. Run your exact prompt set, score every significant competitor on mention rate, share of voice, rank, and sentiment, and put it in one grid.

The gap grid is where strategy comes from. It surfaces the uncomfortable, useful questions:

  • A rival has 2x your mention rate on problem-based prompts but you're even on branded prompts. They've won the top of funnel; you're only found by people already looking for you.
  • You lead on sentiment but trail on share of voice. The models like you but don't think of you first — an awareness problem, not a trust problem.
  • A brand you've never heard of keeps appearing. Go read what the models are citing about them. There's a playbook there.

This is the part of the work that's tedious to do by hand and the main reason tools exist. AEOeye's free audit runs a prompt set across the major assistants and returns exactly this grid — your mention rate, share of voice, and where you sit against named competitors — which is a faster way to get a baseline than building the harness yourself.

How to actually track this over time

Four rules separate a real measurement program from a one-off screenshot you forget about.

Freeze the prompt set. Movement only means something against a constant. Version your prompt list, and when you add questions, start a new tracked series rather than contaminating the old one.

Run each prompt several times per assistant. These models are non-deterministic — the same question can name different brands across runs. One run is an anecdote. Five runs per prompt, averaged, is a measurement. Without this, you'll mistake normal sampling variance for real change and chase ghosts.

Separate the assistants. Don't average ChatGPT, Perplexity, and Gemini into one blended number. They draw on different sources and behave differently — you might dominate Perplexity (which leans heavily on fresh, citable web content) while barely registering in ChatGPT. The per-engine breakdown tells you where to spend effort.

Set a cadence and a review ritual. Weekly automated runs, a monthly human review of the trend lines. The monthly review is where you ask the only question that matters: what changed, and what did we do that caused it? Most teams collect the data and never close that loop, which makes the whole exercise theater.

What to do with a bad baseline

Low mention rate means the models lack corroborated information to recommend you — invest in third-party presence: comparison content, review platforms, mentions on sites the models cite. Low rank with decent mention rate means you're known but not framed as a leader — build authority signals and get cited by sources that rank brands. Bad sentiment means there's poison in the well — find the source and neutralize it. A widening competitor gap on problem prompts means someone is out-publishing you on the questions buyers actually ask.

Pick the worst of your five metrics and fix that first. The mistake is trying to move all five at once; they don't respond at the same speed, and you won't learn what worked.

The brands that win the AI recommendation layer over the next two years won't be the ones with the best product. They'll be the ones who noticed the layer existed, started measuring it before their competitors did, and treated a drop in mention rate as the same kind of fire alarm a traffic crash used to be. Start with a baseline. You can't improve a number you've never looked at.

FAQ

What is mention rate and how do I calculate it?+

Mention rate is the percentage of relevant buyer prompts where your brand appears in the AI answer at all. Run a fixed set of 30-50 category, comparison, and problem questions across assistants like ChatGPT and Perplexity, count how many name your brand, and divide by the total. It's the foundation metric — it tells you whether the models even know you exist in a given context before you worry about rank or sentiment.

Why can't I just use Google Analytics or keyword rankings to measure AI visibility?+

Because both measure the wrong thing. Keyword rankings assume a page of blue links, but AI answers synthesize two or three named recommendations with no 'position 7.' And the most valuable AI interactions never produce a trackable click — a buyer reads a recommendation, forms an impression, and converts later via branded search, so the attribution is lost. AI visibility is a reputation metric, not a traffic metric.

What's a good mention rate for my brand?+

It depends on whether you're a leader or challenger in your category. A healthy challenger typically sits around 30-50% mention rate on relevant prompts, while category leaders hit 70% or more. If you're under 15% on questions you should obviously appear in, you have a discovery problem: the models lack enough corroborated third-party information to recommend you confidently.

How often should I track AI visibility metrics?+

Run automated measurement weekly with a frozen prompt set, executing each prompt several times per assistant to average out the models' non-deterministic output. Then hold a monthly human review of the trend lines to ask what changed and what you did that caused it. The weekly cadence catches movement; the monthly review closes the loop between action and result.

Why does sentiment in AI answers matter so much?+

Because AI assistants state framing with the calm authority of a neutral narrator, so buyers tend to believe it. A recurring negative like 'limited integrations' or 'pricing isn't transparent' actively costs deals. When you find negative framing, trace it to its source — often a review roundup, a forum thread, or your own outdated docs — and fix the source material, since the framing follows weeks later.

Is AI recommending you?

Run a free AI visibility audit and find out in under a minute.

Free · No signup · Results in under a minute

Keep reading