Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By mistakeknot
Agent trust scoring — reputation tracking, severity-weighted decay, and suppression candidates.
npx claudepluginhub mistakeknot/interagency-marketplace --plugin intertrustAgent trust scoring for Claude Code. Tracks which review agents produce useful findings and which waste tokens.
When interflux dispatches review agents, some consistently produce findings you act on — and some produce noise you dismiss. Intertrust closes the feedback loop: it records each accept/dismiss decision, computes a trust score per agent, and feeds that score back into dispatch priority so the good agents run first and the noisy ones get deprioritized.
The scoring algorithm uses severity-weighted time decay: a P0 finding accepted yesterday counts more than a P3 finding dismissed last month. Scores blend project-specific data with global data, so a new project inherits the agent's cross-project reputation until enough local data accumulates.
Intertrust was extracted from the interspect profiler to maintain single-responsibility: interspect handles evidence collection and routing overrides; intertrust handles reputation and trust.
First, add the interagency marketplace (one-time setup):
/plugin marketplace add mistakeknot/interagency-marketplace
Then install:
/plugin install intertrust
Check which agents are earning trust and which are candidates for suppression:
/trust-status
AGENT PROJECT TRUST ACCEPTED DISCARD REVIEWS
fd-safety my-project 0.92 18 2 20
fd-correctness my-project 0.85 12 3 15
fd-game-design my-project 0.15 1 12 13 <!>
Agents with trust < 0.30 are flagged with <!> as suppression candidates. These agents consistently produce findings that nobody acts on.
For a specific agent:
/trust-status fd-safety
Score range: 0.05 (floor) to 1.0 (ceiling).
Inputs: Every time you resolve a review finding (via /clavain:resolve), the outcome is recorded:
Severity weighting: A P0 finding counts 4x, P1 counts 2x, P2 counts 1x, P3 counts 0.5x. Catching a real security issue (P0, accepted) boosts trust much more than flagging a style nit (P3, accepted).
Time decay: Half-life of ~30 days. Recent outcomes matter more than old ones. An agent that improved its prompts last week shouldn't be penalized for noise it generated two months ago.
Project/global blending: New projects inherit the agent's global reputation until enough local data accumulates (blend weight reaches 1.0 at 20 local reviews).
Integration: interflux multiplies each agent's triage score by its trust score at dispatch time. High-trust agents get dispatched first. Low-trust agents may not get dispatched at all if the token budget is tight.
Trust scoring is progressive enhancement — it never blocks workflows. If intertrust is not installed, all agents get a neutral trust score of 1.0.
Trust data lives in the shared .interspect/interspect.db SQLite database (the trust_feedback table). The library is self-contained with no dependency on the interspect plugin — it creates its own table if needed.
intertrust/
├── .claude-plugin/plugin.json # Plugin manifest
├── hooks/
│ └── lib-trust.sh # Trust scoring library (233 lines)
├── commands/
│ └── trust-status.md # /trust-status command
└── tests/
└── test_trust_scoring.sh # 11 tests
Share bugs, ideas, or general feedback.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
AGT governance hooks and MCP tools for Claude Code sessions
Trace analysis and context remediation for AI agents
Check how well your repo supports AI coding agents.
One 👎 becomes a hard rule the agent cannot bypass. Captures thumbs-down feedback, distills it into PreToolUse Pre-Action Checks, enforced across every future Claude Code session.
Intelligent orchestration platform for AI coding tools — routes tasks to the best model, learns from outcomes, and enforces quality through multi-model consensus. 42 MCP tools for agent management, research, memory, consensus voting, codebase intelligence, and a full dev pipeline.
DashClaw governance, integration, troubleshooting, and platform intelligence for Claude Code.
[DEPRECATED — use intervoice] Analyze your writing style and adapt Claude's output to sound like you. Replaced by intervoice, which reads one global multi-register profile instead of per-project glob-routed files.
Recursive AGENTS.md generator with integrated Oracle critique, CLAUDE.md harmonization, incremental updates, diff previews, and smart monorepo scoping. Cross-AI compatible.
Self-improving agent rig: codifies product and engineering discipline into composable workflows from brainstorm to ship. Compounds knowledge, generates domain agents, monitors its own docs, and surfaces conservative update drift. Orchestrates Claude, Codex, and Oracle through 6 agents, 52 commands, 19 skills, 0 MCP servers. Factory substrate: CXDB turn DAG, scenario bank with satisfaction scoring, evidence pipeline, agent capability policies. Companions: interspect, interphase, interline, interflux, interpath, interwatch, interslack, interform, intercraft, interdev, interpeer, intertest.
Token-efficient code reconnaissance for LLMs. Autonomous skills save 48-85% tokens via diff-context, semantic search, structural patterns, and symbol analysis. Includes MCP server for direct tool integration.
Token efficiency benchmarking, session analytics, and API-equivalent cost analysis for agent workflows
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claim