We analyzed millions of AI-generated citations across engines like ChatGPT, Microsoft Copilot, Google AI Overview, and Perplexity to understand what actually drives citation frequency.
The goal was simple: identify whether traditional metrics like web traffic correlate with visibility in AI responses - and uncover what brands can do to influence their presence.
This is a snapshot of a larger dataset we maintain at BrandLight, and it reveals some critical truths - and unexpected patterns - how visibility in AI systems really works.
Methodology: Mapping Citations to Visibility
We studied domains across three axes:
- Citation frequency: how often a domain is cited in AI responses
- Number of distinct sources: how many unique domains cite it
- Estimated web traffic: traditional visit volume
We then explored how these metrics relate - and don’t relate - to each other.
Key Findings
1. Website Traffic Does Not Predict AI Citations
Domains with massive traffic often had minimal citations. Meanwhile, some low-traffic sites appeared in tens of thousands of AI responses.
Example:
- A domain with only 8,500 visits appeared in 23,787 citations.
- Another with 15 billion visits was cited less.
Insight: AI engines value authority and contextual relevance over raw traffic numbers. Popularity doesn’t equal trust in the eyes of LLMs.
.png)
2. Citations Are Strongly Tied to Source Diversity
The strongest correlation was between citation frequency and the number of unique domains referencing a page.
Insight: AI engines seem to pick up signals based on how widely a brand or page is discussed, not just how often.

Data Snapshots
Domains with High Citations, Low Traffic
These domains appeared thousands of times in AI responses but had little web traffic:
- 23,787 citations, 8.5K visits
- 15,423 citations, 677K visits
- 12,552 citations, 16 visits (!)
Domains with High Traffic, Low Citations
These pages attracted millions or even billions of visits but were rarely cited:
- 1.5B visits, <5K citations
- 6.1B visits, minimal citation activity
Benchmark Performers: High Sources, High Citations
A small set of domains achieved both breadth (many sources) and frequency (high citations). These are likely reference hubs: well-structured, often cited, and trusted across contexts.

Visual Analysis
- Citations vs Visits: Virtually no correlation (r = 0.02)
- Citations vs Sources: Strong positive correlation (r = 0.71)
- Visits vs Sources: Weak correlation (r = 0.14)
These charts reinforce the point: traffic and visibility in AI are two different games.
Strategic Takeaways for Brands
- Don’t assume high traffic means high AI visibility
- Many high-citation pages have almost no direct audience
- Focus on ecosystem influence
- Get mentioned across diverse, trusted domains
- Prioritize citations in sources like Wikipedia, Reddit, and editorial media
- Track and shape citations
- Tools like BrandLight.ai help map what’s being cited, where, and how
- Then build or influence those touchpoints directly
Final Word: Visibility is the New Ranking
If you're still equating SEO traffic with impact in AI search, you're missing how LLMs actually surface answers.
Citation quality, contextual breadth, and source diversity are the new levers.
This is why we’re investing in AEO: AI Engine Optimization - helping brands move beyond keywords and start shaping their presence across the sources AI actually learns from.
Read our foundational research on AI source behavior
Want to know how often you're actually being cited in AI? Let’s talk.