By yihui504
基于 LLM 的向量数据库自动化缺陷挖掘工具。支持 Milvus/Qdrant/Weaviate/PGVector 四种向量数据库,通过自然语言契约逆向工程从官方文档提取结构化约束,结合多 Agent 辩论机制在 Docker 沙箱中自动发现合规性缺陷。
TestVDB 缺陷挖掘流水线主编排器。协调全部 16 个 Agent 完成从战略情报采集到缺陷报告的全流程。
你是 TestVDB 的知识获取 Agent,负责从官方文档和在线资源中提取目标向量数据库的 API 信息、约束条件和版本数据。
将原始 API 知识文档转换为结构化的机器可读契约 JSON。
历史 Issue 挖掘 Agent — 爬取目标仓库的 Issues 和已合并 PR,构建原始缺陷语料库。
Bug Shape 提取 Agent — 对历史 Issues 三分类并提取根因模式和开发者认知边界。
TestVDB 结构化契约 JSON Schema 参考。当 Contract Formalizer Agent 或相关 Agent 需要了解契约格式时自动加载。
TestVDB 四型缺陷分类法参考。当 Judge 或 Attack Agent 需要判定缺陷类型时自动加载。
TestVDB Docker 容器模板参考。当 Executor Agent 需要启动目标向量数据库容器时自动加载。
TestVDB 缺陷挖掘流水线 SOP。当 Orchestrator 编排缺陷挖掘流水线时自动加载。
Matches all tools
Hooks run on every tool call, not just specific ones
Executes bash commands
Hook triggers when Bash tool is used
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Modifies files
Hook triggers on file write and edit operations
Modifies files
Hook triggers on file write and edit operations
Requires secrets
Needs API keys or credentials to function
Requires secrets
Needs API keys or credentials to function
Uses power tools
Uses Bash, Write, or Edit tools
Uses power tools
Uses Bash, Write, or Edit tools
Has parse errors
Some configuration could not be fully parsed
Has parse errors
Some configuration could not be fully parsed
English | 中文
Automated Defect Mining for Vector Databases
TestVDB is an LLM-powered Claude Code plugin that automatically discovers compliance defects in vector databases. It reverse-engineers structured contracts from official documentation, generates targeted attack scripts through multi-agent debate, executes them in Docker sandboxes, and produces verified defect reports with full evidence chains.
Currently supports Milvus, Qdrant, Weaviate, and pgvector.
The monolithic /testvdb:mine pipeline is now split into three independently-triggerable, intelligently-collaborating commands:
| Command | Stage | Output |
|---|---|---|
/testvdb:contract <db> <version> [--force] | Doc extraction + contract generation | structured_contract.json |
/testvdb:intel <db> [--max-issues N] [--max-commits N] [--force] | Intelligence gathering + threat modeling | threat_model.json |
/testvdb:mine <db> <version> [--intel | --contract] [...] | Attack mining (intelligently consumes intel/contract cache) | defects + reports |
Smart cache reuse (D-judgment) — scripts/check_cache.py decides whether to reuse cached intel/contract via four conditions: exists → TTL-fresh → valid → target/version match. Any miss → regenerate; all hit → pure mining (skip generation, save time).
--intel/--contract parameter control with C-boundary semantics:
| Cache state | --xxx false behavior |
|---|---|
| MISSING (no cache) | Error exit ("missing, run /testvdb:xxx first") |
| STALE / INVALID | Use existing + warning (no refresh) |
| USABLE | Use as-is |
This distinguishes "I have it but want the old one" from "I don't have it at all" — preventing silent mining without prerequisites.
End-to-end verified (CC 2.1.165): 5 agent types dispatched successfully with zero unknown — knowledge-extractor, contract-formalizer, issue-miner, bug-shape-extractor, threat-modeler.
scripts/hooks/pipeline_gate.py) validates three LLM shortcut symptoms at session end — (1) document analysis coverage below threshold, (2) unjustified fallback without documented reason, (3) pipeline phase not reaching DONE. Gate performs exact string matching (not fuzzy) — generic or placeholder URLs result in exit 2 interception.raw_knowledge.md → locate ## Document Sources table → copy URLs character-by-character._resolve_round_dir() correctly resolves timestamp_dir against project_root (pipeline v3 convention) with fallback to session_dir-relative paths.TESTVDB_GATE_ACTIVE_THRESHOLD (default 600s) and TESTVDB_DOC_COVERAGE_THRESHOLD (default 0.6) configurable via environment variables.pipeline_state.json v3 — phase-level checkpoint recovery across context compaction.ScheduleWakeup-driven cross-turn iteration; reconstruct_context.py rebuilds full pipeline context from disk state at each loop turn.safe_request() pattern — zero bare API calls.validate_api_format.py in Stage 1 debate.validate_target_neutrality.py ensures attack scripts don't leak DB-specific signatures (e.g. Qdrant port 6333 when target is Weaviate).reporter.md (defect reports) split from reporter-mre.md (MRE scripts).npx claudepluginhub yihui504/testvdb --plugin testvdbComplete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Unity Development Toolkit - Expert agents for scripting/refactoring/optimization, script templates, and Agent Skills for Unity C# development
Comprehensive .NET development skills for modern C#, ASP.NET, MAUI, Blazor, Aspire, EF Core, Native AOT, testing, security, performance optimization, CI/CD, and cloud-native applications
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.