75/100
The median B2B site scores 75/100 for AI readiness; entry to the top decile starts at 86.
Scored across crawl access, structured data, extractability, answerability, entity clarity, trust, freshness, and off-site presence.
n=636 sites ·
Benchmark report
We audited 636 B2B websites for how visible, structured, and citable they are to AI answer engines. The median scores 75/100. Here is where the average site wins, where it falls short, and what it means for being cited by ChatGPT, Perplexity, and Google AI Overviews.
Live data, refreshed weekly · · Methodology
636
B2B sites analyzed
75/100
Median AI-readiness score
47–91
Score range
86/100
Top-decile threshold (p90)
Quotable, anchored, datestamped
Every figure below is computed from the live dataset. Each card carries its own anchor and sample size, and the copy button gives you a ready-to-paste citation.
75/100
Scored across crawl access, structured data, extractability, answerability, entity clarity, trust, freshness, and off-site presence.
n=636 sites ·
67–82
Clearing the pack does not require excellence; it requires fixing what most sites leave broken.
n=636 sites ·
98/100
Differentiation has moved down the stack, from access to structure and evidence.
n=636 sites ·
+17
If you want to know what the best sites do differently, start here.
n=636 sites ·
91/100
AI readiness is a maintained property, not a finished project.
n=636 sites ·
Overall score distribution
The middle half of the field sits in the 67–82 band around a median of 75. The tail above 86 is where AI engines find sites they can parse, verify, and cite without guesswork.
By audit dimension, weakest first
Most sites have the basics covered: bot access & control plane averages 98/100. The gap is in the machine-readable layer: structured data averages just 49/100. Each bar shows the middle half of the field (band), the median (tick), and the 90th percentile (dot).
avg 49
The machine-readable layer. JSON-LD tells an engine who you are, what you sell, and which page answers what. The bottom quartile ships essentially none of it, which makes correct citation a coin flip.
avg 57
Datestamps, bylines, and article schema. Engines discount undated, unattributed content, and half the field shows almost no freshness signals at all. This is also where the top decile separates hardest.
avg 68
Whether a machine can tell who you are: organization schema, the brand name in title and H1, linked profiles. Ambiguity here is how engines mix you up with a competitor.
avg 73
About, contact, privacy and terms pages, security headers, no exposed secrets. Engines weigh accountability signals when deciding what is safe to recommend.
avg 78
p25 83 · median 83 · p75 92 · p90 100
What the rest of the web says about you: third-party mentions, source diversity, authority, recency. The hardest dimension to fake, and a large separator at the top of the field.
avg 79
Question-shaped headings, definitions, lists, and concrete data points an engine can lift verbatim. Decent on average; the gap between adequate and quotable is where citations are won.
avg 84
Clean titles, a single H1, sane text-to-markup ratio, alt text. Mostly competent across the field; failures here are self-inflicted and cheap to fix.
avg 89
p25 70 · median 100 · p75 100 · p90 100
HTTPS, fast responses, no redirect chains, and content that exists without running JavaScript. The median site passes outright; the bottom quartile pays a steep tax, often for JS-only rendering.
avg 98
p25 100 · median 100 · p75 100 · p90 100
robots.txt, sitemaps, and AI-crawler policy. Effectively solved: nearly everyone lets the engines in. Letting them in is not the same as giving them something to cite.
Gap between the median site and the 90th percentile
Where the spread between the median and the 90th percentile is widest, the best sites are doing something the rest are not. Where it is narrow, the dimension is either solved or uniformly neglected.
| Dimension | Median | p90 | Gap |
|---|---|---|---|
| Off-site Presence & Mentions | 83 | 100 | +17 |
| Fetch, Render, and URL Integrity | 100 | 100 | +0 |
| Bot Access & Control Plane | 100 | 100 | +0 |
How the data is collected
Figures aggregate automated AI-readiness audits of 636 public B2B websites, scored 0–100 across nine dimensions covering crawl access, structured data, extractability, answerability, entity clarity, trust, freshness, and off-site presence. Each domain contributes its most recent completed audit inside a 365-day window. The sample is self-selected: these are sites whose teams chose to run an audit, which likely skews it toward the AI-aware end of the market. Data as of .
The full scoring rubric, including every check, weight, and known limitation, is public: how the audit scores sites. The live, interactive view and the per-brand leaderboard live in the audit app: app.nyman.media/insights.