Unified scoring framework with weighted categories, severity levels, pass/fail thresholds, and auto-fail conditions. Use when computing or interpreting audit scores.
From ai-readinessnpx claudepluginhub bailejl/dev-plugins --plugin ai-readinessThis skill uses the workspace's default tool permissions.
Searches prompts.chat for AI prompt templates by keyword or category, retrieves by ID with variable handling, and improves prompts via AI. Use for discovering or enhancing prompts.
Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.
Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.
This skill defines the unified scoring methodology used across all ai-readiness audit commands. Reference this when computing scores, interpreting results, or explaining thresholds.
Every audit command uses weighted category scoring:
| Raw Score Range | Condition |
|---|---|
| 100 | No violations found |
| 80–99 | Minor violations only |
| 60–79 | One or more Major violations |
| 40–59 | Critical violations present |
| 0–39 | Multiple Critical violations or systemic failure |
| Level | Symbol | Description | Score Impact |
|---|---|---|---|
| Critical | 🔴 | Fundamentally broken, dangerous, or directly exploitable. Requires immediate action. | Caps category raw score at 59 max |
| Major | 🟠 | Significant quality/security issue affecting correctness or maintainability. | Caps category raw score at 79 max |
| Minor | 🟡 | Style, preference, or defense-in-depth issue. Low immediate risk. | Reduces raw score by 1–5 per finding |
| Rating | Symbol | Meaning |
|---|---|---|
| Critical | 🔴 | Systemic failure requiring immediate remediation |
| High | 🟠 | Significant issues degrading AI performance |
| Medium | 🟡 | Suboptimal patterns with measurable impact |
| Low | 🔵 | Minor issues, address opportunistically |
| Pass | ✅ | Section meets or exceeds expectations |
| Level | Weight Range | Description |
|---|---|---|
| 🔴 CRITICAL | 20–25 | Blocks elite DORA performance; immediate action required |
| 🟠 HIGH | 10–15 | Significant impediment to delivery; address within sprint |
| 🟡 MEDIUM | 5–10 | Suboptimal practice; schedule improvement work |
| 🟢 LOW | 1–5 | Minor issue; address opportunistically |
| Command | Pass Threshold | Rationale |
|---|---|---|
| code-review | ≥ 75 | Standard quality bar |
| architecture | ≥ 75 | Standard quality bar |
| security | ≥ 80 | Security requires higher bar due to outsized risk |
| testing | ≥ 75 | Standard quality bar |
| api-review | ≥ 75 | Standard quality bar |
Certain findings trigger an automatic FAIL regardless of the numeric score:
Category: Error Handling (Weight: 20%)
Finding 1: Empty catch block in auth.ts:45 → Major
Finding 2: Swallowed promise rejection in api.ts:102 → Major
Finding 3: Missing null check in utils.ts:23 → Minor
→ Worst severity: Major → Raw score capped at 79 max
→ Adjusted for volume: Raw score = 65
→ Weighted score = 65 × 0.20 = 13.0
Category: Naming (Weight: 10%)
Finding 1: Inconsistent casing in helpers/ → Minor
→ Raw score = 90
→ Weighted score = 90 × 0.10 = 9.0
Final score = sum of all weighted scores