AI search visibility and AI SEO optimization. Activates when discussing AI Overviews, AI Mode, AEO, GEO, LLM SEO, ChatGPT visibility, Perplexity mentions, Gemini AI, Copilot, answer engine optimization, generative engine optimization, or any AI search presence topic. Phase 14. Output: {AUDIT_DIR}/ai-seo-findings.md
From local-seo-auditnpx claudepluginhub anthropics/claude-plugins-community --plugin local-seo-audit-systemThis skill uses the workspace's default tool permissions.
Designs and optimizes AI agent action spaces, tool definitions, observation formats, error recovery, and context for higher task completion rates.
Enables AI agents to execute x402 payments with per-task budgets, spending controls, and non-custodial wallets via MCP tools. Use when agents pay for APIs, services, or other agents.
Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.
AI search is the fastest-growing traffic channel in 2025. Google AI Overviews appear for 20–35% of local service queries (SparkToro 2025). ChatGPT Search has 200M+ monthly active users. Perplexity processes 100M+ queries/month. Being cited in AI answers = significant visibility gain; being absent = invisible to AI-first searchers. This phase audits where the client appears (or doesn't) across all major AI platforms, identifies content gaps causing AI invisibility, and produces a prioritized action plan for AI citation dominance.
2025 AI search benchmarks:
| Tool | Purpose | Cost |
|---|---|---|
| Google Search (incognito) | Test AIO presence for top 20–30 target keywords | Free |
| ChatGPT (GPT-4o with Search) | LLM visibility test — local service queries | Free/Paid |
| Perplexity | AI search citations for local service queries | Free |
| Google Gemini | Gemini AI visibility test + source attribution | Free |
| Microsoft Copilot | Bing-powered AI visibility | Free |
| Google Rich Results Test | FAQPage, HowTo, Speakable schema validation | Free |
| Ahrefs | Featured snippet ownership — AIO pulls from snippets | Paid |
| SEMrush | Featured snippet tracking per target keyword | Paid |
| AlsoAsked.com | PAA question mapping — AIO uses PAA patterns | Freemium |
| Otterly.ai | Track AIO mentions over time (monitoring) | Paid |
AI search is no longer emerging — it IS the mainstream:
For local businesses, AI search drives both:
Read {AUDIT_DIR}/intake-data.md — business name, URL, services, location.
Read {AUDIT_DIR}/competitor-profiles.md — competitor AI visibility signals.
Read {AUDIT_DIR}/onpage-findings.md — content structure gaps.
Test top 20-30 target keywords in Google Search (incognito, desktop + mobile):
For each query, document:
| Query | AIO Shows? | Client Cited? | Competitors Cited? | Source URLs Used | AIO Format |
|---|---|---|---|---|---|
| [query] | Yes/No | Yes/No | [names] | [URLs] | Paragraph/List/Table/Steps |
To be cited in AI Overviews, content needs:
Google AI Mode (launched 2024, expanding 2025) enables fully conversational search.
| Query | Client Mentioned? | Accuracy | Competitor Priority | Source |
|---|
For each platform × query type:
Platforms to test:
Query types to test:
Best [service] in [city][Service] near [neighborhood/landmark][Business Name] — accuracy check[Business Name] reviews — reputationHow to find a good [service type] in [city][Specific service] cost in [city]For each result document:
AEO focuses on structuring content so AI can extract and use it as answers.
For each key service page and FAQ page:
Since AI Overviews pull from featured snippets:
GEO focuses on making content easily citable by AI systems.
Specific, quotable statements with data? Good: "Chicago homeowners saved an average of $2,400/year after our insulation upgrade" Bad: "We help homeowners save money on energy bills"
Original research, surveys, or case studies?
Named expert quotes with credentials?
Step-by-step instructions with clear numbered format?
Comparison tables with specific data?
Local statistics about the service area?
Industry benchmarks stated clearly?
AI systems cite authoritative sources. Check:
AI systems understand entities better than raw keyword text:
Schema types most valuable for AI citation:
| Schema Type | AI Benefit |
|---|---|
| LocalBusiness | Business entity establishment |
| FAQPage | Direct Q&A extraction |
| HowTo | Step-by-step answer extraction |
| Article + Author | E-E-A-T, content authority |
| AggregateRating | Trust signal for AI recommendations |
| Speakable | Marks content optimized for voice/AI reading |
| Service | Clear service entity definition |
| Review | Specific review content citation |
Validate all schema at: search.google.com/test/rich-results
| Platform | Client Score | Comp 1 | Comp 2 | Comp 3 |
|---|---|---|---|---|
| Google AIO | ||||
| ChatGPT Search | ||||
| Perplexity | ||||
| Gemini | ||||
| Copilot | ||||
| Overall AI Score |
Findings: Which competitor dominates AI visibility and why? What content do they have that the client lacks?
| Action | Impact (1–5) | Feasibility (1–5) | Priority | Effort |
|---|---|---|---|---|
| Add FAQPage schema to all service pages | 5 | 5 | 25 | 30 min/page |
| Rewrite service page intros (direct 50-word answer) | 5 | 4 | 20 | 30–60 min/page |
| Add specific data points (costs, timelines) to pages | 5 | 4 | 20 | 30–60 min/page |
| Create dedicated FAQ page (conversational queries) | 4 | 5 | 20 | 2–4 hrs |
| Add question-format H2s to top 5 service pages | 4 | 4 | 16 | 30 min/page |
| Add HowTo schema to process pages | 4 | 5 | 20 | 30 min/page |
| Publish original research / local statistics page | 5 | 3 | 15 | 4–8 hrs |
| Build E-E-A-T author profiles (credentials, bios) | 4 | 4 | 16 | 2–4 hrs |
| Refresh pages > 30 days old (freshness = AIO boost) | 4 | 5 | 20 | 30–60 min/page |
| Add Speakable schema to key answer sections | 3 | 4 | 12 | 30 min |
| Category | Weight |
|---|---|
| AI platform visibility (AIO + ChatGPT + Perplexity + Gemini + Copilot) | 20% |
| AI citability scoring (passage-level extractability) | 20% |
| AI crawler access (Tier 1–3 crawlers + llms.txt) | 15% |
| AEO content structure (answer blocks, FAQs, featured snippets) | 15% |
| Platform-specific optimization readiness | 15% |
| Schema for AI readability (FAQPage, Speakable, knowsAbout) | 15% |
Write complete findings to {AUDIT_DIR}/ai-seo-findings.md with YAML frontmatter:
---
skill: ai-visibility/ai-seo
phase: 14
date: [YYYY-MM-DD]
business: [Business Name]
url: [URL]
score: [X/100]
aio_cited: [yes|no|partial]
chatgpt_cited: [yes|no]
perplexity_cited: [yes|no]
faqpage_schema: [yes|no|partial]
aio_keywords_tested: [X]
aio_keywords_cited: [X]
---
Include:
Output files:
{AUDIT_DIR}/ai-seo-findings.md — AI visibility audit with score and citation gap analysis{REPORTS_DIR}/phase-14-ai-seo.pdf — auto-generated PDF after phase completesKey consumers:
ai-visibility/voice-search — voice and AIO share optimization signalsaudit/onpage-seo — AIO structure requirements inform on-page optimizationcross-cutting/serp-trust-auditor — Trust & AI Readiness (T2) dimensionoutput/report-generation — AI visibility section in master reportAI models cite passages that meet specific structural criteria. GEO-optimized content achieves 30–115% higher visibility in AI-generated responses (Georgia Tech / Princeton / IIT Delhi 2024). Optimal AI-cited passages are 134–167 words, self-contained, fact-rich, and answer-first.
| Category | Weight | What It Measures | Scoring |
|---|---|---|---|
| Answer Block Quality | 30% | Does the passage open with a direct, quotable answer? Uses "X is..." or answer-first patterns? First 40–60 words stand alone? | 90+: every section opens with 1–2 sentence answer; 50–69: answers buried mid-paragraph; <30: no extractable answers |
| Self-Containment | 25% | Can the passage be extracted without context? No pronoun dependencies? Named subject in every block? | 90+: 80%+ blocks fully standalone; 50–69: mixed pronoun/explicit use; <30: continuous narrative, extraction breaks meaning |
| Structural Readability | 20% | H1>H2>H3 hierarchy? Question-based headings? Short paragraphs (2–4 sentences)? Tables for comparisons? Lists for processes? | 90+: clean hierarchy + question headings + tables/lists; 50–69: some structure; <30: wall-of-text |
| Statistical Density | 15% | Specific stats per 500 words? Named sources? Exact numbers (not "many" or "several")? | 90+: 5+ stats/500w, all sourced; 50–69: 1–2 stats/500w; <30: no statistics, vague quantifiers |
| Uniqueness & Original Data | 10% | First-party research? Proprietary data? Case studies with specific outcomes? | 90+: original surveys/research; 50–69: synthesized with some original commentary; <30: entirely derivative |
Page Citability Score = weighted average of all content block scores.
HIGH citability (score ~85):
Content delivery networks (CDNs) are distributed server systems that cache and serve web content from locations geographically close to end users. A CDN reduces latency by 50–70% on average by serving assets from edge servers rather than a single origin server. The three largest CDN providers as of 2025 are Cloudflare (serving approximately 20% of all websites), Amazon CloudFront, and Akamai Technologies. — 58 words. Self-contained: yes. Facts: 3 specific data points. Definition pattern: yes.
LOW citability (score ~15):
If you've ever wondered why some websites load faster than others, the answer might surprise you. There's this amazing technology that has been around for a while now. It's changed the way we think about web performance. Let me explain how it works. — 52 words. Self-contained: no (no topic named). Facts: 0. Definition pattern: no.
| Finding | Source |
|---|---|
| Optimal AI-cited passage length: 134–167 words | Bortolato 2025 analysis of AI Overview passages |
| Definition patterns increase citation rate by 2.1× | Georgia Tech 2024 |
| Adding statistics increases citation by 40% | Princeton GEO study 2024 |
| Adding authority quotations increases citation by 115% in some categories | IIT Delhi 2024 |
| Fluency optimization increases visibility by 30% average | Georgia Tech 2024 |
| Content with source citations cited 20–25% more often by Perplexity/ChatGPT | Industry data 2025 |
| AI System | Citation Preference |
|---|---|
| Google AI Overviews | Concise answer blocks (40–60 words). Content already ranking in top 10. Structured formatting (tables, lists). |
| ChatGPT (Search) | Explicit definitions, named sources, recent dates. Cites 2–4 sources per response. Wikipedia sourced 47.9% of citations. |
| Perplexity | Fact-dense passages with statistics. Cites 4–8 sources per response. Values recency highly. Reddit sourced 46.7% of citations. |
| Gemini | Multi-modal (text + images + video). YouTube content weighted heavily. Uses Knowledge Graph for entity grounding. |
| Copilot (Bing) | Passages from high-authority domains with clear factual claims. IndexNow for fast indexation. |
For each key page:
AI crawlers must be able to access your site — blocked crawlers = invisible to that platform regardless of content quality. Over 35% of top 1,000 websites block at least one major AI crawler (Originality.ai 2025).
| Crawler | Operator | User-Agent | Purpose | Impact if Blocked |
|---|---|---|---|---|
| GPTBot | OpenAI | GPTBot | ChatGPT web browsing + search. Content may be used for model improvement. | Invisible in ChatGPT Search (900M+ weekly users) |
| OAI-SearchBot | OpenAI | OAI-SearchBot | ChatGPT search only — NOT used for training. | Not in ChatGPT search results even if GPTBot allowed |
| ChatGPT-User | OpenAI | ChatGPT-User | User-initiated URL visits ("read this page for me") | Users can't access your content via ChatGPT |
| ClaudeBot | Anthropic | ClaudeBot | Claude web search, citations, analysis | Invisible to Claude's web search |
| PerplexityBot | Perplexity | PerplexityBot | Perplexity AI search — always displays source links (best referral traffic of AI search products) | No Perplexity citations |
| Crawler | Operator | User-Agent | Purpose | Notes |
|---|---|---|---|---|
| Google-Extended | Google-Extended | Gemini training + AI Overviews improvement. Blocking does NOT affect Google Search ranking or AIO appearance (controlled by Googlebot). | Allow unless philosophical objection to training data usage | |
| GoogleOther | GoogleOther | Non-search-ranking purposes — research, experimental features | Low risk, moderate benefit | |
| Applebot-Extended | Apple | Applebot-Extended | Apple Intelligence features, Siri AI (2B+ active Apple devices) | Growing strategic value |
| Amazonbot | Amazon | Amazonbot | Alexa answers, Amazon AI features | Relevant for voice search |
| FacebookBot | Meta | FacebookBot | Meta AI assistant (3B+ combined app users) | Growing importance |
| Crawler | Operator | Recommendation | Reason |
|---|---|---|---|
| CCBot | Common Crawl | Context-dependent | Training data for many AI companies. No live search impact. |
| anthropic-ai | Anthropic | Context-dependent | Claude model training only (separate from ClaudeBot live features) |
| Bytespider | ByteDance | BLOCK for most Western businesses | Aggressive crawling, minimal benefit outside Asian markets |
| cohere-ai | Cohere | Context-dependent | Enterprise AI training, low consumer impact |
<meta name="robots" content="noai">, bot-specific noindexX-Robots-Tag: noai, X-Robots-Tag: GPTBot: noindex/llms.txt, /.well-known/ai-plugin.json, /ai.txt# Tier 1 — AI Search (ALLOW)
User-agent: GPTBot
Allow: /
User-agent: OAI-SearchBot
Allow: /
User-agent: ChatGPT-User
Allow: /
User-agent: ClaudeBot
Allow: /
User-agent: PerplexityBot
Allow: /
# Tier 2 — AI Ecosystem (ALLOW)
User-agent: Google-Extended
Allow: /
User-agent: Applebot-Extended
Allow: /
User-agent: Amazonbot
Allow: /
User-agent: FacebookBot
Allow: /
# Tier 3 — Training Only (BLOCK aggressive)
User-agent: Bytespider
Disallow: /
| Component | Weight |
|---|---|
| Tier 1 crawlers allowed (5 bots) | 50% |
| Tier 2 crawlers allowed (5 bots) | 25% |
No blanket AI blocks (no noai meta, no User-agent: * Disallow: /) | 15% |
| AI-specific files present (llms.txt, sitemap accessible) | 10% |
The llms.txt standard (proposed by Jeremy Howard, Sept 2024) is an emerging file at /llms.txt that helps AI systems understand your site's purpose, structure, and key content. Analogous to robots.txt telling crawlers what NOT to access, llms.txt tells AI systems what IS most useful. As of early 2026, fewer than 5% of websites have llms.txt — early adopter advantage.
# [Business Name]
> [One sentence: what the business does, who it serves. Under 200 chars.]
## Services
- [Service Page Title](https://domain.com/service): Description of service, key facts.
## Resources
- [Guide Title](https://domain.com/guide): What this guide covers and why it matters.
## Key Facts
- Founded in [year] by [name]
- Headquarters: [City, State]
- [X] customers/clients served in [area]
- Services: [Service A], [Service B], [Service C]
## Contact
- Website: https://domain.com
- Phone: [phone]
- Email: [email]
- Address: [full address]
| Element | Check | Severity if Missing |
|---|---|---|
| H1 Title (matches business name) | Present? | Critical |
| Blockquote description (<200 chars) | Present? | High |
| At least one H2 section | Present? | Critical |
| 10–30 page entries with absolute URLs | Present? | High |
| Descriptions after colon (10–30 words each) | Present? | Medium |
| Key Facts section with business data | Present? | Medium |
| Contact section with email + phone | Present? | Low |
| All listed URLs return 200 | Validated? | Medium |
| llms-full.txt (extended version) | Present? | Low (nice-to-have) |
| Dimension | Weight |
|---|---|
| Completeness (covers all major site sections, 10–30 pages) | 40% |
| Accuracy (descriptions match page content, URLs valid) | 35% |
| Usefulness (AI could understand the business from this file alone) | 25% |
llms.txt fileOnly 11% of domains are cited by BOTH ChatGPT and Google AI Overviews for the same query (Terakeet 2025). Each platform has different indexes, ranking logic, and source preferences. Platform-specific optimization is required.
Top actions: Question-based H2/H3 headings, direct answer in first paragraph (40–60 words), comparison tables, ordered/unordered lists, FAQ sections, statistics with sources, visible publication + updated dates, author byline with credentials.
Top actions: Wikipedia/Wikidata entity, Bing Webmaster Tools registration, Reddit brand presence, authoritative .edu/.gov backlinks, consistent entity information across platforms, comprehensive long-form content.
Top actions: Active Reddit presence in relevant subreddits, original research/data, discussion-friendly content, YouTube content with transcripts, freshness signals (visible dates, regular updates), cross-source claim validation.
Top actions: YouTube channel with relevant content + chapters/timestamps, Google Knowledge Panel, GBP completion, comprehensive Schema.org markup, Google ecosystem presence (Scholar, News, Maps), image optimization.
Top actions: Bing Webmaster Tools verification, IndexNow implementation, LinkedIn company page, optimized meta descriptions, social engagement signals, fast page load (<2s), exact-match keywords in titles/headings.
| Priority | Google AIO | ChatGPT | Perplexity | Gemini | Copilot |
|---|---|---|---|---|---|
| #1 | Top-10 ranking | Wikipedia | Reddit presence | YouTube | IndexNow |
| #2 | Q&A structure | Entity graph | Original research | Knowledge Panel | Bing WMT |
| #3 | Tables/lists | Bing SEO | Freshness | Schema.org | |
| #4 | Featured snippets | Community forums | GBP | Meta descriptions |
| Schema Type | AIO Citation Boost | Implementation | Effort |
|---|---|---|---|
| FAQPage | 3.2× citation rate | Add to all service + location pages (5 FAQ min) | 30 min/page |
| HowTo | 2.1× for process queries | Add to any "how to" or process service pages | 45 min/page |
| Organization + sameAs + knowsAbout | Entity confidence boost + topical expertise signal | Homepage only; include 7+ sameAs URLs + 5–7 knowsAbout topics | 30 min |
| AggregateRating | +17–25% branded CTR | All pages with real review data | 30 min |
| LocalBusiness subtype | Local AIO eligibility | Homepage + key service pages | 30 min |
| BreadcrumbList | Navigation context for AI | Sitewide (via CMS plugin often) | 1 hr |
| Speakable | Marks content for voice/AI assistant extraction | Add to Article/WebPage schema with CSS selectors pointing to key answer sections | 15 min/page |
| Platform | Local Coverage | Citation Criteria | Update Frequency |
|---|---|---|---|
| Google AI Overviews | 20–35% of local queries | E-E-A-T + schema + topical authority | Real-time (Gemini-based) |
| ChatGPT (web search) | 15–25% of local queries | Freshness (76.4% cite <30-day pages), Bing index | Weekly crawl |
| Perplexity AI | 10–20% of local queries | Source authority + content depth + Reddit presence | Real-time |
| Google Gemini (assistant) | Growing rapidly | Knowledge Graph entity + YouTube | Real-time |
| Bing Copilot | 5–15% of local queries | Bing indexation + content trust + IndexNow | Daily crawl |
| Metric | Value | Source |
|---|---|---|
| GEO services market (2025) | $850M–$886M | Yahoo Finance / Superlines |
| Projected GEO market (2031) | $7.3B (34% CAGR) | Industry analysts |
| AI-referred sessions growth | +527% (Jan–May 2025) | SparkToro |
| AI traffic conversion vs organic | 4.4× higher | Industry data |
| Google AI Overviews reach | 1.5B users/month, 200+ countries | |
| ChatGPT weekly active users | 900M+ | OpenAI |
| Perplexity monthly queries | 500M+ | Perplexity |
| Gartner: search traffic drop by 2028 | −50% | Gartner |
| Brand mentions vs backlinks for AI | 3× stronger correlation | Ahrefs Dec 2025 |
| Marketers investing in GEO | Only 23% | Industry surveys |
GBP-driven AI Overviews for local queries: "best [service] near me" queries now show AIO with local pack. To appear:
INP for AI search: Googlebot renders pages before indexing for AIO inclusion. Pages with INP >500ms may have content not fully extracted for AI citations. Target INP <200ms on all schema-rich pages.