By pskoett
Orchestrate AI-assisted coding sessions with upfront planning interviews that align on constraints and prevent scope drift, continuous context window monitoring to maintain output quality, and automated simplify/harden passes with security reviews on modified code, while capturing errors and knowledge gaps into a self-improving learning system that promotes insights into project-level agent instructions.
Monitors context window health by re-reading wave anchor artifacts and detecting drift signals. Spawnable by the context-surfing skill or standalone for periodic context health checks during long-running sessions. Read-only — inspects state but does not modify files.
Read-only security auditor that finds security and resilience gaps in modified files. Checks for input validation, error handling, injection vectors, auth/authz, secrets, data exposure, dependency risk, and race conditions. Reports findings with file, line, category, severity, attack vector, and specific fix. Use when auditing code changes for security hardening.
Captures learnings, errors, and corrections to .learnings/ files. Spawnable by other skills or at session end to log quality/security findings, user corrections, command failures, or knowledge gaps. Can write to .learnings/LEARNINGS.md, ERRORS.md, and FEATURE_REQUESTS.md.
Read-only auditor that finds unnecessary complexity in modified files. Checks for dead code, naming issues, control flow, API surface, over-abstraction, and consolidation opportunities. Reports findings with file, line, category, severity, and specific fix. Use when auditing code changes for simplification opportunities.
Read-only spec auditor that finds gaps between implementation and spec/plan. Checks for missing features, incorrect behavior, incomplete implementation, contract violations, test coverage, and acceptance criteria gaps. Reports findings with file, line, category, spec reference, and severity. Use when verifying implementation completeness against a plan or spec.
Implementation + audit loop using parallel agent teams with structured simplify, harden, and document passes. Spawns implementation agents to do the work, then audit agents to find complexity, security gaps, and spec deviations, then loops until code compiles cleanly, all tests pass, and auditors find zero issues or the loop cap is reached. Use when: implementing features from a spec or plan, hardening existing code, fixing a batch of issues, or any multi-file task that benefits from a build-verify-fix cycle.
Monitors context window health throughout a session and rides peak context quality for maximum output fidelity. Activates automatically after plan-interview and intent-framed-agent. Stays active through execution and hands off cleanly to simplify-and-harden and self-improvement when the wave completes naturally or exits via handoff. Use this skill whenever a multi-step agent task is underway and session continuity or context drift is a concern. Especially important for long-running tasks, complex refactors, or any work where degraded context would silently corrupt the output. Trigger even if the user doesn't say "context surfing" — if an agent task is running across multiple steps with intent and a plan already established, this skill is live.
Frames coding-agent work sessions with explicit intent capture and drift monitoring. Use when a session transitions from planning/Q&A to implementation for coding tasks, refactors, feature builds, bug fixes, or other multi-step execution where scope drift is a risk.
Ensures alignment between user and Claude during feature/spec planning through a structured interview process. Use this skill when the user invokes /plan-interview before implementing a new feature, refactoring, or any non-trivial implementation task. The skill runs an upfront interview to gather requirements across technical constraints, scope boundaries, risk tolerance, and success criteria before any codebase exploration. Do NOT use this skill for: pure research/exploration tasks, simple bug fixes, or when the user just wants standard planning without the interview process.
Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Claude ('No, that's wrong...', 'Actually...'), (3) User requests a capability that doesn't exist, (4) An external API or tool fails, (5) Claude realizes its knowledge is outdated or incorrect, (6) A better approach is discovered for a recurring task. Also review learnings before major tasks. For CI-only/headless learning capture, use self-improvement-ci.
Executes bash commands
Hook triggers when Bash tool is used
Modifies files
Hook triggers on file write and edit operations
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Uses power tools
Uses Bash, Write, or Edit tools
Uses power tools
Uses Bash, Write, or Edit tools
A collection of skills for AI agents. Follows the Agent Skills specification. This repository is my personal skill testing ground.
Every skill in this collection is built around a philosophy — a principle that addresses a specific failure mode in how agents work today. plan-interview is about collaborative planning: before codebase exploration starts, user and agent run a structured interview to align on constraints, scope, risk, and success criteria — and to surface whether a preparatory refactor should come before the main change. intent-framed-agent makes execution intent explicit so scope drift becomes visible. context-surfing monitors context quality and exits cleanly before degradation corrupts output. simplify-and-harden uses the peak context at end-of-task for a focused quality and security review. self-improvement turns repeated mistakes into durable rules that persist across sessions.
The common thread: agents have peak context at specific moments — after planning, mid-execution, at completion, after learning — and these skills are designed to exploit those peaks. Each skill encodes a philosophy that agents struggle to internalize on their own, turning it into a structured workflow they can follow reliably. skill-pipeline ties these pieces together by classifying the task and routing it through the right depth of process.
npx skills add pskoett/pskoett-ai-skills
skills/
skill-name/
SKILL.md # Required - skill definition with YAML frontmatter
scripts/ # Optional - executable code
references/ # Optional - documentation loaded on demand
assets/ # Optional - templates, images, data files
| Skill | Description |
|---|---|
| agent-teams-simplify-and-harden | Implementation + audit loop using parallel agent teams with structured simplify, harden, and document passes |
| context-surfing | Monitors context window health and rides peak context quality for maximum output fidelity during multi-step execution |
| dx-data-navigator | Query DX Data Cloud for developer productivity metrics, DORA metrics, PR/deployment data, and engineering analytics |
| intent-framed-agent | Captures a lightweight intent contract at execution start and monitors coding-task drift until resolution |
| plan-interview | Runs a structured interview before planning non-trivial implementations |
| self-improvement | Captures learnings and errors with hook-based activation and automatic skill extraction |
| skill-pipeline | Pipeline orchestrator that classifies tasks and routes them through the right skill combination at the right depth |
| simplify-and-harden | Post-completion self-review that runs simplify, harden, and micro-documentation passes before signaling done |
These skills are experimental and currently part of the testing ground setup.
| Skill | Description |
|---|---|
| self-improvement-ci | CI-only self-improvement workflow for recurring failure-pattern capture using gh-aw |
| simplify-and-harden-ci | CI-only simplify/harden workflow for pull requests using gh-aw with headless scan/report gates |
Each skill prevents a distinct failure mode:
| Skill | Failure it prevents |
|---|---|
plan-interview | Building the wrong thing |
intent-framed-agent | Scope creep during execution |
context-surfing | Degraded-context corruption |
simplify-and-harden | Shipping rough/insecure code |
self-improvement | Repeating the same mistakes |
[plan-interview] → [intent-framed-agent] ⟂ [context-surfing] → [simplify-and-harden] → [self-improvement]
↑ concurrent ↑
Stage 1 — Planning (manual gate): plan-interview runs a structured interview and produces a plan file in docs/plans/. This is the only skill that requires explicit invocation (/plan-interview). Downstream skills activate automatically when present, but each works independently if earlier stages are skipped.
Stage 2 — Execution (concurrent monitoring): intent-framed-agent captures the intent frame and monitors scope drift. context-surfing monitors context quality drift. Both run simultaneously. If both fire at once, context-surfing's exit takes precedence — degraded context makes scope checks unreliable.
Stage 3 — Review (post-completion): simplify-and-harden runs three passes (simplify, harden, document) on the completed work.
npx claudepluginhub pskoett/pskoett-ai-skillsProduction-grade engineering skills for AI coding agents — covering the full software development lifecycle from spec to ship.
Harness for Claude Code — skills, /harness:* slash commands, persona subagents, lifecycle hooks, and MCP tools without per-repo `harness setup`. Sibling plugins exist for Cursor, Gemini CLI, and Codex.
No description provided.
Agent Skills for improving SKILL.md files: mine repeated workflows from history, personalize and audit existing skills, or generalize personal skills for publication.
Professional skill creation with TDD workflow. Features dual-mode (fast/full), behavioral validation, and automated quality gates for 9.0/10+ scores.
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use