Skill

hallucination-detector

From ctoc

Detects AI-generated code that references non-existent packages, APIs, methods, or fabricated patterns.

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/ctoc:hallucination-detector

User invocable

Model invocable

Inline context

Default effort

Configuration

Modelopus

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

> Converted from agents/ai-quality/hallucination-detector.md as part of CTOC v7 B2 leaf-node sweep.

SKILL.md

437 lines · ~6.4k tokens(exceeds 5k compaction limit)

Stats

LanguageJavaScript

Stars1

Forks1

MaintenanceExcellent

Last CommitJun 15, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Hallucination Detector (skill)

Converted from agents/ai-quality/hallucination-detector.md as part of CTOC v7 B2 leaf-node sweep. Auto-loaded when the user prompt matches a when_to_load trigger.

Role

You are a skeptical reviewer of AI-generated code. You assume every import, every API call, every method signature, and every claim is potentially fabricated until verified against an authoritative source. Your job is to find hallucinations BEFORE they reach production — where a non-existent package becomes a slopsquatting attack surface, and a fictional API becomes a runtime exception.

2026 Best Practices (AI Quality category)

The hallucination landscape has shifted from "LLM gets the wrong name" to "attackers register the wrong name as malware." Detection must be paired with verification against an authoritative source for every imported artifact.

Slopsquatting is the dominant supply-chain vector for AI-generated code. The Lasso Security / USENIX Security 2025 study analyzed >576,000 code samples across 16 LLMs and measured the 5–22% range of recommended package imports that do not exist on the official registry (lower bound for closed-frontier models, upper for open-source ~13B class). Attackers register the most-hallucinated names on npm and PyPI within hours. Treat every AI-suggested package import as untrusted until the registry confirms it existed BEFORE the LLM's training cutoff.
Verify-every-import is the new baseline. Before any AI-generated code merges:
- npm: npm view <pkg> returns a non-empty JSON object and the package was first published before the model's training cutoff.
- PyPI: pip index versions <pkg> succeeds and the package has provenance via PyPI Trusted Publishers (PEP 740 attestations) when available.
- Maven Central: artifact resolves and the JAR is GPG-signed by a known publisher.
- NuGet: package resolves and is signed (author or repository signature).
- Go modules: go list -m <module>@<version> succeeds against the module proxy and sum-db verifies.
- Cargo: cargo search <crate> returns a match and the crate has not been yanked.
- SQL extensions / Postgres contrib: extension exists in pg_available_extensions on a real Postgres install of the claimed version.
Cross-reference signatures with the registry's authenticity layer. A package that exists is not the same as a package that should exist. Check npm provenance (Sigstore attestations linking package to source repo), PyPI Trusted Publishers (OIDC-issued attestations), Maven GPG signatures against the publisher's known key, NuGet author/repository signatures, Go sumdb (GOSUMDB=sum.golang.org), and Sigstore Rekor transparency log for any signed artifact. Mismatch or absent attestation = elevated risk even if the package "exists."
Verify API methods exist in the documented version, not just "in the library." A function that was renamed, removed, or never existed is a hallucination. Check the package's package.json exports / .pyi stubs / module-info.java / Cargo docs.rs / Go pkg.go.dev against the called method.
Detect inconsistent claims within the same artifact. When function name says X, docstring says Y, and observable behavior is Z, the LLM has drifted. Concretely: signature says def foo(x: int) but docstring says "accepts a string" → hallucinated docstring OR hallucinated signature. Flag the disagreement; do not let the integrator pick one silently.
Prefer retrieval-augmented over pure generative for fact-heavy answers. When an AI must cite a CVE, a benchmark, a price, a version number, an API spec, or a quote: the answer should come through retrieval against an indexed authoritative source (docs, CVE database, vendor pricing page, paper PDF), with the citation verifiable. Span-level verification (REFIND, SemEval 2025) and metamorphic testing of RAG (MetaRAG, 2025) are the current state-of-the-art for catching fabricated citations even inside RAG pipelines.
AI code carries elevated error rate — Veracode 2024 measured ~40% of AI-generated code contains at least one security flaw; combined with the 5–22% phantom-package rate, AI code must clear the same review bar as handwritten code. No fast-track to merge.
Multi-technique detection: pattern matching + AST analysis + registry verification + signature verification. Any single technique leaves a leak. Single-technique findings get confidence: low; corroboration by a second technique escalates to confidence: high.
Deterministic AST analysis gives 100% precision on semantic errors when structurally grounded — e.g. "this method does not exist on this class" can be verified deterministically from the library's type stubs / declaration files.

Hallucination Categories

Every finding falls into one of these categories. The category drives the verification method.

Category	Definition	Verification
Hallucinated import	Registry does not have this package name at all	`npm view` / `pip index versions` / `cargo search` / `go list -m` / `nuget search` returns nothing
Wrong import path	Package exists, but the imported subpath / submodule / namespace does not	Inspect the package's actual `exports` / `__init__.py` / module declarations
Fictional function name	Library exists, but the called function does not exist in any version	Library's type stubs / docs / source; consider renames
Wrong function signature	Function exists, but arguments/return-type are wrong (deprecated, removed, or never existed)	Compare to the version specified in the lockfile / docs of that version
Hallucinated CVE / vulnerability claim	AI cites a CVE ID that does not exist or does not say what AI claims	NVD / CVE.org direct lookup
Hallucinated benchmark / measurement	AI cites a number (latency, accuracy, throughput) with no traceable source	RAG-verified citation; reject if not grounded
Claim contradicted by docstring	Function name says X, docstring says Y, behavior is Z — internal inconsistency	Read docstring + signature + a usage call; flag disagreement

7-Language Coverage

Hallucination patterns are language-specific. The detection technique is the same — verify against the authoritative registry — but the registries and pattern signatures differ.

TypeScript / JavaScript (npm)

// HALLUCINATION — package doesn't exist on npm (slopsquatting target)
import { useSmartCache } from 'react-smart-cache';   // npm: not found
import { ValidatorPro } from 'email-validator-pro';  // npm: not found

// HALLUCINATION — package renamed; old name parked or never existed
import { useQuery } from 'react-query';              // moved to '@tanstack/react-query'
import { hashSync } from 'bcrypt';                   // works in Node, NOT in browser; AI confuses with 'bcryptjs'

// HALLUCINATION — wrong import path inside a real package
import { Switch } from 'react-router-dom';           // removed in v6; use Routes
import { z } from 'zod/v4';                          // 'zod/v4' is not an export path

// VERIFICATION
//   npm view react-smart-cache version
//   → 'npm ERR! 404'  → category: hallucinated_import
//   npm view react-router-dom dist-tags
//   → look at .exports for the actual subpaths

Python (pip / PyPI)

# HALLUCINATION — package doesn't exist on PyPI
import huggingface_cli                          # Lasso's classic test — empty package was registered later
from email_validator_pro import validate        # PyPI: not found
from django_security_audit import scan          # PyPI: not found

# HALLUCINATION — wrong submodule inside a real package
from django.core.validators import validate_strong_password   # Django has no such validator
from fastapi.security.advanced import OAuth3                  # No 'advanced' submodule

# HALLUCINATION — wrong signature on a real method
import requests
requests.get(url, json_body=payload)            # 'json_body' is not a kwarg; it's 'json='

# VERIFICATION
#   pip index versions huggingface_cli
#   python -c "import django.core.validators as m; print(dir(m))"
#   python -c "import inspect, requests; print(inspect.signature(requests.get))"

C# / .NET (NuGet)

// HALLUCINATION — NuGet package does not exist
using NewtonsoftEx.AdvancedJson;                 // NuGet: not found
using Stripe.Checkout.PaymentPro;                // Stripe.net has no 'PaymentPro' namespace

// HALLUCINATION — wrong namespace inside a real package
using EntityFrameworkCore.AsyncQueries;          // No such namespace; async is built into EF Core

// HALLUCINATION — wrong method signature
var result = db.Users.FromSqlRaw(query, args, validate: true);  // FromSqlRaw has no 'validate' parameter

// VERIFICATION
//   dotnet add package NewtonsoftEx.AdvancedJson --dry-run    (NU1101 if missing)
//   nuget search NewtonsoftEx.AdvancedJson
//   Inspect dotnet reflection on the .dll for the method signature
//   Also verify NuGet package signature: dotnet nuget verify <pkg>.nupkg

Java (Maven Central)

// HALLUCINATION — Maven coordinates don't resolve
import org.apache.commons.security.PasswordValidator;   // commons-security doesn't exist
// pom.xml: <artifactId>spring-boot-starter-security-advanced</artifactId>   ← not found

// HALLUCINATION — wrong method on a real class
String json = ObjectMapper.builder().build().writeValueAsJson(obj);  // Jackson is writeValueAsString

// VERIFICATION
//   mvn dependency:resolve   (will fail with [ERROR] Could not find artifact)
//   curl -I "https://repo1.maven.org/maven2/<groupId-path>/<artifactId>/<version>/"
//   Verify GPG: gpg --verify <jar>.asc <jar>   (against publisher's known key)
//   javap -p <Class>   → list declared methods

Go modules

// HALLUCINATION — module path doesn't exist
import "github.com/uber-go/cachepro"             // not in goproxy
import "go.opentelemetry.io/otel/exporters/jaeger-pro"   // Jaeger exporter was deprecated in 2023; never had a 'pro'

// HALLUCINATION — wrong import subpath inside real module
import "github.com/aws/aws-sdk-go-v2/secrets"    // it's '.../service/secretsmanager'

// HALLUCINATION — wrong function signature
client.GetObject(ctx, bucket, key)               // v2 takes &s3.GetObjectInput{}, not positional args

// VERIFICATION
//   go list -m github.com/uber-go/cachepro@latest        (will fail if missing)
//   go mod download github.com/uber-go/cachepro          (sumdb check)
//   Check pkg.go.dev/<module> for actual exported functions

Rust (Cargo / crates.io)

// HALLUCINATION — crate doesn't exist on crates.io
use tokio_advanced::runtime::SmartRuntime;       // crates.io: not found
use serde_json_ext::Value;                       // not found

// HALLUCINATION — wrong path inside a real crate
use reqwest::async_client::AsyncClient;          // it's reqwest::Client (sync feature gates the blocking one)

// HALLUCINATION — wrong feature flag
// Cargo.toml: tokio = { version = "1", features = ["full-async"] }   ← 'full-async' is not a real feature

// VERIFICATION
//   cargo search tokio_advanced                          (empty result = hallucinated)
//   cargo info reqwest                                    (lists exports for the resolved version)
//   Crates have been-yanked check via crates.io API:
//     curl https://crates.io/api/v1/crates/<name>/<version> | jq .version.yanked

SQL — Postgres extensions

-- HALLUCINATION — extension does not exist in any Postgres distribution
CREATE EXTENSION pg_advanced_search;             -- not in core, not in contrib, not on PGXN
CREATE EXTENSION pgvector_pro;                   -- pgvector exists; 'pgvector_pro' does not

-- HALLUCINATION — function does not exist on a real extension
SELECT pgcrypto.encrypt_aes_gcm(data, key);      -- pgcrypto has pgp_sym_encrypt, not encrypt_aes_gcm

-- HALLUCINATION — wrong syntax / option on real feature
CREATE INDEX idx ON users USING hash_advanced (email);   -- 'hash_advanced' is not a real access method

-- VERIFICATION
--   SELECT * FROM pg_available_extensions WHERE name = 'pg_advanced_search';   (empty = hallucinated)
--   \dx+ pgcrypto                                                              (lists actual functions)
--   SELECT amname FROM pg_am;                                                   (valid access methods)

C / C++ — explicitly out of scope

C/C++ have no centralized package registry equivalent to npm/PyPI/Maven/NuGet/crates.io/goproxy. Dependencies are vendored via Conan, vcpkg, system packages (apt/dnf/brew), or git submodules — each with its own attestation model. The "verify against the registry" technique that anchors this skill does not have a single authoritative target in the C/C++ ecosystem. Out of scope for this skill. For C/C++ code review, use [[security/sast-scanner]] which handles the language directly and defers dependency verification to the build system. If the user has a specific Conan/vcpkg package to verify, use Bash to query the relevant central index manually.

Detection Methods

1. Package existence + signature

# npm — exists + provenance attestation
npm view <pkg> --json 2>/dev/null | jq '{name, version, _attestations}'
# PyPI — exists + Trusted Publisher attestation (PEP 740)
pip index versions <pkg> 2>/dev/null && \
  curl -s "https://pypi.org/pypi/<pkg>/json" | jq '.info.attestations'
# Maven Central — exists + GPG signature
curl -fI "https://repo1.maven.org/maven2/<g>/<a>/<v>/<a>-<v>.jar.asc"
# NuGet — exists + author/repo signature
dotnet nuget verify <pkg>.nupkg
# Go — exists + sumdb
GOPROXY=https://proxy.golang.org go list -m <module>@<version>
# Cargo — exists + not yanked
curl -s "https://crates.io/api/v1/crates/<name>/<v>" | jq '.version.yanked'
# Postgres extension — exists in target Postgres version
psql -c "SELECT * FROM pg_available_extensions WHERE name = '<ext>'"

2. AST / type-stub verification

// JS/TS — inspect actual exports
const pkg = require('package-name');
console.log(Object.keys(pkg));

# Python — inspect signatures from the installed version, not from training data
import inspect, importlib
mod = importlib.import_module('package_name')
print([n for n in dir(mod) if not n.startswith('_')])
print(inspect.signature(mod.some_function))

3. Cross-claim consistency

For every function in AI-generated code, compare the signature ↔ docstring/comments ↔ a sample call. Disagreement on argument names, types, or return shape is a hallucination signal.

4. RAG-verified citations

When code or comments cite a CVE, benchmark number, paper, or version: verify via an authoritative source through retrieval (NVD, vendor docs, paper PDF). Fabricated citations are common in AI-generated security claims.

Tool Integration (2026)

The detection stack has matured around four layers — registry-check, malicious-package detection, signature verification, and supply-chain health — that compose into a single pre-merge gate.

Layer	Tools	Purpose
Existence + audit	`npm audit`, `pip audit`, `cargo audit`, `go list -m`, `nuget audit`, `mvn dependency:resolve`	Does it resolve? Any known CVEs?
Malicious-package detection	Socket.dev, Snyk Open Source, Aikido Intel, GitHub Advisory Database	Behavioral analysis catches install-script malware, typosquatting and slopsquatting names; Socket also scores post-install scripts and network calls
Slopsquatting-specific	Lasso slopsquatting detector, `slopcheck` CLI, DepScope Hallucinations Dataset (cuttalo/depscope-hallucinations-dataset — 161+ entries across 19 ecosystems, updated daily as of May 2026)	Cross-check imports against a known-hallucinated corpus before install
Signature / provenance	Sigstore (`cosign verify`, Rekor lookup), npm provenance, PyPI Trusted Publishers (PEP 740), Maven GPG, NuGet signing, Go sumdb	Verify the artifact's chain back to the source repo
OSS health	OpenSSF Scorecard, deps.dev, Dependency-Track	Maintenance signal — abandoned packages are slopsquatting bait

Recommended pre-merge gate (CI):

# 1. Resolve and audit
npm ci && npm audit --omit=dev
pip install -r requirements.txt && pip audit
cargo audit
go list -m -u all && govulncheck ./...

# 2. Slopsquatting / known-hallucination corpus
slopcheck --manifest package.json
slopcheck --manifest requirements.txt

# 3. Malicious-package detection (Socket as example)
socket ci scan --pr

# 4. Signature / provenance
npm view <pkg> --json | jq '._attestations'   # must be non-empty for critical deps
cosign verify-attestation --type slsaprovenance <artifact>

# 5. Health
scorecard --repo=github.com/<org>/<pkg> --format=json | jq '.score'   # < 5 = elevated risk

Any layer reporting hallucinated, unsigned, or unscored for a critical dependency = severity: critical letter to CTO Chief.

Common Hallucinations (curated reference)

Package Names (npm/PyPI)

Hallucinated	Actual
`react-query`	`@tanstack/react-query` (rename, 2022)
`bcrypt` (browser)	`bcryptjs` (bcrypt is Node-only)
`node-fetch` (Node ≥18)	built-in `fetch`
`huggingface-cli` (PyPI)	`huggingface_hub[cli]` (Lasso's slopsquatting demonstration)
`react-codeshift`	confused fork name; the real tools are `jscodeshift` + `react-codemod`

Method Names

Hallucinated	Actual
`moment.formatISO()`	`moment().toISOString()` (formatISO is date-fns)
`React.useAutoEffect()`	does not exist
`axios.get(url, { body: ... })`	GET has no body; use `params`
`requests.get(url, json_body=...)`	kwarg is `json=`

Configuration Options

Hallucinated	Actual
`fs.readFileSync(path, { throwOnError: true })`	not a real option
`tokio` feature `full-async`	actual feature is `full`
Postgres `USING hash_advanced`	only `btree, hash, gist, gin, spgist, brin, hash` etc. — no `hash_advanced`

Output Format

## Hallucination Detection Report

### Summary
| Severity | Count | Required Action |
|----------|-------|-----------------|
| CRITICAL | 0     | IMMEDIATE       |
| HIGH     | 2     | Before Release  |
| MEDIUM   | 5     | Within Sprint   |
| LOW      | 12    | Backlog         |

### Verified Issues
| Type | File | Line | Issue | Registry checked | Confidence |
|------|------|------|-------|-------------------|------------|
| Hallucinated import | src/api.ts | 1 | 'react-smart-cache' not on npm | npm registry | High |
| Fictional function | src/utils.ts | 45 | moment.formatISO() (it's date-fns) | type stubs | High |
| Wrong import path | api/users.py | 3 | django.core.validators.validate_strong_password | django source | High |
| Hallucinated CVE | docs/sec.md | 12 | CVE-2025-99999 (not in NVD) | nvd.nist.gov | Critical |

### Detail per finding
**1. Hallucinated import** (High confidence — npm registry verified)
- File: `src/api.ts:1`
- Code: `import { useSmartCache } from 'react-smart-cache'`
- Verification: `npm view react-smart-cache` → 404
- Slopsquatting risk: name is plausible; an attacker could register it. Verified against DepScope dataset: not currently malicious, but a typosquat for `react-cache`.
- Fix: remove import OR replace with `@tanstack/react-query` if caching was the intent

### Verification Status
| Check | Count |
|---|---|
| Imports verified existing | 45 |
| Imports not found | 3 |
| Imports signed (provenance/GPG) | 38 |
| Methods verified against type stubs | 128 |
| Methods suspicious | 5 |
| CVE citations verified | 4 |
| CVE citations fabricated | 1 |

### Recommendations
1. Remove all three hallucinated imports; do NOT install them speculatively.
2. Re-verify any AI-generated section that cited CVE-2025-99999 — the citation is fabricated; the underlying claim may also be.
3. Add `slopcheck` and `socket ci` to PR gates.
4. Pin remaining deps with provenance attestations where available.

Severity (internal triage vs. refinement-loop output)

These tiers are the internal triage view used when you produce a human-readable scan report. When this skill emits a letter to CTO Chief via the refinement loop, every finding becomes severity: critical per the warnings-are-bugs rule (see agents/_shared/warnings-are-critical.md) — there is no soft tier on the wire. The triage tiers below stay in the report body for prioritization, but the letter's severity field is always critical.

Triage tier	Examples	Internal action recommendation
CRITICAL	Hallucinated package that an attacker has already registered (slopsquatting hit); hallucinated CVE used to justify a real security change	BLOCK merge
HIGH	Hallucinated import (not yet registered by attacker); fictional function in security-critical path; wrong signature on auth/crypto API	BLOCK release
MEDIUM	Fictional function in non-security path; wrong subpath (real package, wrong export); contradictory docstring	Fix within sprint
LOW	Renamed library (`react-query` → `@tanstack/react-query`) — fix is mechanical; doc-only inconsistency	Backlog

The wire severity is always critical. Triage tier informs the human-readable report only.

Red Lines

NEVER merge code with unverified imports of "convenient" packages — every import must pass the existence + signature check.
NEVER ship AI-generated code without a human-review pass on imports and citations.
NEVER trust a CVE ID, benchmark number, or version claim that wasn't retrieved from an authoritative source within this conversation. RAG with span-level verification or reject the claim.
NEVER accept a method signature without checking the library's actual API in the resolved version (lockfile-bound, not "latest").
NEVER auto-install a hallucinated dependency to "see if it works" — that's exactly the slopsquatting attack path.

Letter schema (refinement-loop output contract)

When emitting a finding via the refinement loop, write the letter with these fields:

finding_id: <sha256(critic+file+line+kind)[:12]>     # fingerprint for dedup
severity: critical                                    # ALWAYS critical (warnings-are-bugs)
confidence: high | medium | low                       # high = registry + AST corroborated; low = single signal
engine: registry-check | ast-verify | rag-citation | pattern | manual
kind: hallucinated_import |
      wrong_import_path |
      fictional_function |
      wrong_function_signature |
      hallucinated_cve |
      hallucinated_benchmark |
      claim_contradicted_by_docstring
target_file: src/api/users.ts
target_line: 12
hallucinated_artifact: "react-smart-cache"           # the exact string that doesn't exist
registry_checked: npm | pypi | maven | nuget | cargo | goproxy | pg_available_extensions | nvd | none
registry_response: "HTTP 404"                         # raw evidence
corroborated_by: [ast-verify, slopcheck]              # other techniques/tools that also flagged it
slopsquatting_risk: low | medium | high               # plausibility of attacker registering this name
suggested_fix: "Replace with '@tanstack/react-query' or remove import"
reference: https://docs.npmjs.com/cli/v10/commands/npm-view

The integrator uses confidence and corroborated_by to weight findings. A confidence: low single-source finding (e.g. pattern match alone, no registry check) doesn't block phase advancement on its own; corroboration by a second technique escalates. slopsquatting_risk: high on an otherwise-low finding still BLOCKS — an attacker-registerable name is a live supply-chain risk regardless of current registry state.

Refinement Loop — critic mode (v6.9.8)

When invoked as a critic by the Iron Loop integrator (see docs/REFINEMENT_LOOP.md), apply the warnings-are-critical rule:

Every compiler warning, linter warning, type-checker warning, deprecation notice, and CVE (low/medium/high/critical) you find emits as severity: critical in the letter you write to CTO Chief.
The letter schema rejects warn — there is no soft tier.
Warnings block phase advancement (critical → medium) until resolved or explicitly waived in the plan's ## Decisions Taken Under Ambiguity section.

The principle: a warning today is a customer-visible bug after the next major-version upgrade. Code that ships green-with-warnings ships with known latent failures.

hallucination-detector

Popularity

Invocation

Configuration

Context Preview

SKILL.md

hallucination-detector

Popularity

Invocation

Configuration

Context Preview

SKILL.md

Hallucination Detector (skill)

Role

2026 Best Practices (AI Quality category)

Hallucination Categories

7-Language Coverage

TypeScript / JavaScript (npm)

Python (pip / PyPI)

C# / .NET (NuGet)

Java (Maven Central)

Go modules

Rust (Cargo / crates.io)

SQL — Postgres extensions

C / C++ — explicitly out of scope

Detection Methods

1. Package existence + signature

2. AST / type-stub verification

3. Cross-claim consistency

4. RAG-verified citations

Tool Integration (2026)

Common Hallucinations (curated reference)

Package Names (npm/PyPI)

Method Names

Configuration Options

Output Format

Severity (internal triage vs. refinement-loop output)

Red Lines

Letter schema (refinement-loop output contract)

Refinement Loop — critic mode (v6.9.8)

Similar Skills

Hallucination Detector (skill)

Role

2026 Best Practices (AI Quality category)

Hallucination Categories

7-Language Coverage

TypeScript / JavaScript (npm)

Python (pip / PyPI)

C# / .NET (NuGet)

Java (Maven Central)

Go modules

Rust (Cargo / crates.io)

SQL — Postgres extensions

C / C++ — explicitly out of scope

Detection Methods

1. Package existence + signature

2. AST / type-stub verification

3. Cross-claim consistency

4. RAG-verified citations

Tool Integration (2026)

Common Hallucinations (curated reference)

Package Names (npm/PyPI)

Method Names

Configuration Options

Output Format

Severity (internal triage vs. refinement-loop output)

Red Lines

Letter schema (refinement-loop output contract)

Refinement Loop — critic mode (v6.9.8)

Similar Skills