Voice search and conversational AI optimization. Activates when discussing voice search, Siri, Alexa, Google Assistant, smart speakers, conversational queries, speakable content, "near me" optimization, or AI answer engines. Phase 18. Output: {AUDIT_DIR}/voice-findings.md
From local-seo-auditnpx claudepluginhub anthropics/claude-plugins-community --plugin local-seo-audit-systemThis skill uses the workspace's default tool permissions.
Records polished WebM UI demo videos of web apps using Playwright with cursor overlay, natural pacing, and three-phase scripting. Activates for demo, walkthrough, screen recording, or tutorial requests.
Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.
Estimates prompt tokens and complexity to offer Brief, Standard, Detailed, or Exhaustive response options with projected token budgets before answering.
Voice search in 2025–2026 is no longer just smart speakers — it's the entire AI conversational interface layer. Google's voice-driven AI Mode, iOS 18 Siri with deep web integration, and AI Overview voice results on mobile collectively account for 27% of mobile searches (Google internal data, 2024). "Near me" searches have grown 200%+ over 5 years. Optimizing for voice = optimizing for AI answer extraction. Local businesses missing voice optimization lose 20–30% of mobile intent queries to competitors.
| Tool | Purpose | Cost |
|---|---|---|
| Google PageSpeed Insights | Mobile speed — voice results require > 70 mobile score | Free |
| Google Rich Results Test | FAQPage, HowTo, Speakable schema validation | Free |
| Google Search | Test "near me" + conversational queries — does AIO appear? | Free |
| ChatGPT | Test voice-style queries — is client mentioned? | Free/Paid |
| Perplexity | AI search visibility test for local queries | Free |
| Google Maps / GBP | Confirm GBP completeness for voice data extraction | Free |
| AlsoAsked.com | Question clusters for FAQ + voice query mapping | Freemium |
| AnswerThePublic | Conversational query patterns per service topic | Freemium |
| BrightLocal | Citation consistency check — NAP for voice accuracy | Paid |
Read {AUDIT_DIR}/intake-data.md — business name, URL, location, services.
Read {AUDIT_DIR}/local-findings.md — GBP completeness (critical for local voice queries).
Read {AUDIT_DIR}/speed-findings.md — mobile speed score (voice requirement: > 70/100 PSI).
Where voice queries originate:
| Platform | Query Type | Local Business Impact |
|---|---|---|
| Google Assistant (Android + mobile) | "Near me", "Open now", hours, directions | Critical — highest local volume |
| Siri (iOS 18+) | Business info, hours, services | High — uses Apple Maps + web |
| Google AI Mode | Conversational follow-up queries | Critical — replacing traditional SERP |
| AI Overviews (voice trigger) | "How to", "What is", "Best [service] in" | High — appears for 20–35% local queries |
| Alexa / Google Home | "Find a [service] near me", business hours | Medium — smart home user base |
| ChatGPT / Perplexity voice | "Best [service] in [city]" recommendations | Growing — AI-first searchers |
2025 voice search stats:
Voice assistants extract local business data directly from GBP. Missing or inaccurate GBP = wrong voice answers.
GBP voice readiness checklist:
| Check | Status | Fix | Effort |
|---|---|---|---|
| Business name 100% accurate (no keyword stuffing) | ✅/❌ | Edit GBP | 5 min |
| Full address including suite/unit | ✅/❌ | Edit GBP | 5 min |
| Phone number (primary click-to-call) | ✅/❌ | Edit GBP | 5 min |
| Hours: all 7 days + holidays + special hours | ✅/❌ | Edit GBP | 15 min |
| "Describe your business" (1,000 char description) | ✅/❌ | Write + edit GBP | 30 min |
| All services listed with accurate names | ✅/❌ | Add services | 30 min |
| Booking link configured (for "book a [service]" queries) | ✅/❌ | Add booking URL | 15 min |
| Q&A section with 5+ common questions answered | ✅/❌ | Add Q&As | 30–45 min |
| GBP verified and no pending flags | ✅/❌ | Verify via phone/video | 1–3 days |
Voice query simulation via GBP:
Map target voice queries across 4 intent categories:
Ranking factors for "near me" voice results:
FAQ quality checklist:
| Check | Standard | Status |
|---|---|---|
| FAQ exists (standalone page or per service) | Required | ✅/❌ |
| Questions in natural conversational language | Required | ✅/❌ |
| Questions mirror actual voice query phrasing | Required | ✅/❌ |
| Answers ≤ 40–50 words (ideal for TTS reading) | Target | ✅/❌ |
| FAQPage + Question + Answer schema implemented | Required | ✅/❌ |
| Answers start with direct response (no preamble) | Required | ✅/❌ |
Optimal FAQ answer format for voice:
Q: How much does [service] cost in [City]?
A: [Service] in [City] typically costs $X–$Y. [1-sentence context].
Call [Business] for a free quote: [phone].
(Under 50 words → voice-readable, AIO-citable, featured snippet-optimized)
For each target voice query:
SpeakableSpecification marks sections optimal for TTS reading:
{
"@type": "SpeakableSpecification",
"cssSelector": [".faq-answer", "h2 + p", ".direct-answer"]
}
Apply on: Homepage summary paragraph, all FAQ answers, key service descriptions.
Voice results have strict technical requirements:
| Requirement | Threshold | Check Method |
|---|---|---|
| Mobile PageSpeed | ≥ 70 PSI score | PageSpeed Insights → Mobile |
| LCP | < 2.5s | PageSpeed Insights → LCP |
| TTFB | < 800ms | WebPageTest |
| HTTPS | All pages | curl -I https://[domain] |
| Mobile-responsive | Full responsiveness at 375px | Chrome DevTools → Device toolbar |
| Tap targets | ≥ 44px (voice → tap follow-up) | Lighthouse → Tap Targets |
Schema for voice ranking:
| Schema Type | Voice Use Case | Priority |
|---|---|---|
LocalBusiness (complete) | Hours, phone, address, directions | Critical |
FAQPage | Q&A voice answers | Critical |
HowTo | Process/instructional queries | High |
Speakable | Marks voice-optimal content sections | Medium |
OpeningHoursSpecification | Hours queries | Critical |
ContactPoint | Phone/contact queries | High |
GeoCoordinates | "Near me" precision | High |
Test across all major AI platforms using voice-style queries:
Test protocol:
Query 1: "Who is the best [service] in [city]?"
Query 2: "Find me a [service provider] near [location]"
Query 3: "[Business Name] — what are their services and hours?"
Query 4: "How do I [most common service-related question]?"
| Platform | Query | Business Mentioned? | Accurate Info? | Source Cited? | Fix Needed |
|---|---|---|---|---|---|
| ChatGPT | Q1 | Yes/No | Yes/No/N.A. | [URL or N/A] | |
| Perplexity | Q1 | Yes/No | Yes/No/N.A. | [URL or N/A] | |
| Gemini | Q2 | Yes/No | Yes/No/N.A. | [URL or N/A] | |
| Google AIO | Q2 | Yes/No | Yes/No/N.A. | [URL or N/A] | |
| Siri | Q3 | Yes/No | Yes/No/N.A. | GBP |
If business not mentioned by ChatGPT/Perplexity: Entity signals too weak — prioritize entity audit (Phase 9) + AIO optimization (Phase 14).
| Competitor | GBP Complete? | FAQ Schema? | Mobile Score | Voice Mentions (ChatGPT) | Voice Mentions (Perplexity) |
|---|---|---|---|---|---|
| Client | ✅/❌ | ✅/❌ | [X] | Yes/No | Yes/No |
| Comp 1 | ✅/❌ | ✅/❌ | [X] | Yes/No | Yes/No |
| Comp 2 | ✅/❌ | ✅/❌ | [X] | Yes/No | Yes/No |
For top 20 voice queries:
| Voice Query | Current Position | Featured Snippet? | FAQPage Schema? | Voice Optimized? | Priority |
|---|---|---|---|---|---|
| [near me query] | [pos] | Yes/No | Yes/No | Yes/No | H/M/L |
| [business-specific] | [pos] | Yes/No | Yes/No | Yes/No | H/M/L |
| Action | Impact (1–5) | Feasibility (1–5) | Priority | Effort |
|---|---|---|---|---|
| Complete GBP Q&A section (5+ questions) | 5 | 5 | 25 | 30–45 min |
| Add FAQPage schema to all service pages | 5 | 5 | 25 | 30 min/page |
| Write 10 FAQ answers ≤ 50 words each | 5 | 4 | 20 | 1–2 hrs |
| Add Speakable schema to FAQ sections | 4 | 4 | 16 | 1–2 hrs |
| Fix mobile PSI score to ≥ 70 | 5 | 3 | 15 | 2–16 hrs |
| Add HowTo schema to process pages | 4 | 4 | 16 | 30 min/page |
| Improve GBP description to 1,000 chars | 3 | 5 | 15 | 30 min |
| Add GeoCoordinates schema | 3 | 5 | 15 | 15 min |
| Add Speakable to homepage summary | 3 | 4 | 12 | 15 min |
| Category | Weight | Score |
|---|---|---|
| GBP completeness for voice queries | 30% | /30 |
| FAQ content + FAQPage schema | 25% | /25 |
| Mobile page speed (PSI ≥ 70) | 20% | /20 |
| Featured snippet presence for top queries | 15% | /15 |
| Structured data completeness (full schema set) | 10% | /10 |
Write findings to {AUDIT_DIR}/voice-findings.md with YAML frontmatter:
---
skill: ai-visibility/voice-search
phase: 18
date: [YYYY-MM-DD]
business: [Business Name]
url: [URL]
score: [X/100]
gbp_complete: [yes|no|partial]
faqpage_schema: [yes|no|partial]
mobile_psi: [X]
featured_snippets: [X]
chatgpt_mentioned: [yes|no]
perplexity_mentioned: [yes|no]
---
Include:
Key consumers:
ai-visibility/ai-seo — voice and AIO share many optimization signalslocal/local-seo — GBP completeness feeds both voice + local packaudit/speed-optimization — mobile speed is voice prerequisitespeakable property on FAQ container. Effort: 1–2 hrs.{AUDIT_DIR}/voice-findings.md + save PDF report to {REPORTS_DIR}/phase-18-voice.pdf. Effort: 30 min.Voice platforms extract answers from fast-rendering pages. INP (Interaction to Next Paint, replaced FID March 2024) is a ranking signal that also affects voice eligibility:
High INP on FAQ pages is caused by: heavy JS frameworks blocking interaction, lazy-loaded FAQ accordions, and third-party chat widgets. Fix by deferring non-critical JS and using static HTML FAQ sections.
Voice assistants prefer cited entities with strong E-E-A-T signals:
| Signal | Voice Impact | Implementation |
|---|---|---|
Schema sameAs (GBP + LinkedIn + Wikipedia) | High | Entity schema on homepage |
| Author bio with credentials | Medium | On all FAQ pages |
| Business established date | Medium | foundingDate in LocalBusiness schema |
| Review count ≥50 on GBP | High | Review velocity strategy |
| GBP description 900–1,000 chars | Critical | Edit GBP description |
Write HTML report to {REPORTS_DIR}/phase-18-voice.html and convert to PDF via python3 scripts/generate_pdf.py --html {REPORTS_DIR}/phase-18-voice.html.
output/report-generation — voice score in master report section 18