Skill

using-agentops

Explains AgentOps operating model, RPI lifecycle (Research-Plan-Implement-Validate), skills (/discovery, /crank, /swarm, /validation), hooks, CLI commands (ao search, /research), and context management.

documentation

developer-tools

Install

npx claudepluginhub boshu2/agentops --plugin agentops

Tool Access

This skill uses the workspace's default tool permissions.

Preview

AgentOps is the operational layer for coding agents.

Supporting Assets

scripts/validate.sh

SKILL.md

Similar Skills

using-git-worktrees

Creates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.

superpowers

168.3k

subagent-driven-development

3 files

Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.

superpowers

168.3k

dispatching-parallel-agents

Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.

superpowers

168.3k

Stats

Stars314

Forks32

Last CommitApr 25, 2026

Actions

View Source View Plugin View on GitHub View README

AgentOps Operating Model

AgentOps is the operational layer for coding agents.

Publicly, it gives you four things:

Bookkeeping — captured learnings, findings, and reusable context
Validation — plan and code review before work ships
Primitives — single skills, hooks, and CLI surfaces
Flows — named compositions like /research, /validation, and /rpi

Technically, AgentOps acts as a context compiler: raw session signal becomes reusable knowledge, compiled prevention, and better next work.

Core Flow: RPI

Research → Plan → Implement → Validate
    ↑                            │
    └──── Knowledge Flywheel ────┘

Research Phase

/research <topic>      # Deep codebase exploration
ao search "<query>"    # Search existing knowledge
ao search "<query>" --cite retrieved  # Record adoption when a search result is reused
ao lookup <id>         # Pull full content of specific learning
ao lookup --query "x"  # Search knowledge by relevance

Output: .agents/research/<topic>.md

Plan Phase

/pre-mortem <spec>     # Simulate failures (error/rescue map, scope modes, prediction tracking)
/plan <goal>           # Decompose into trackable issues

Output: Beads issues with dependencies

Implement Phase

/implement <issue>     # Single issue execution
/crank <epic>          # Autonomous epic loop (uses swarm for waves)
/swarm                 # Parallel execution (fresh context per agent)

Output: Code changes, tests, documentation

Validate Phase

/vibe [target]         # Code validation (finding classification + suppression + domain checklists)
/post-mortem           # Validation + streak tracking + prediction accuracy + retro history
/retro                 # Quick-capture a single learning

Output: .agents/learnings/, .agents/patterns/

Phase-to-Skill Mapping

Phase	Primary Skill	Supporting Skills
Discovery	`/discovery`	`/brainstorm`, `/research`, `/plan`, `/pre-mortem`
Implement	`/crank`	`/implement` (single issue), `/swarm` (parallel execution)
Validate	`/validation`	`/vibe`, `/post-mortem`, `/retro`, `/forge`

Choosing the skill:

Use /implement for single issue execution. Now defaults to TDD-first — writes failing tests before implementing. Skip with --no-tdd.
Use /crank for autonomous epic execution (loops waves via swarm until done). Auto-generates file-ownership maps to prevent worker conflicts.
Use /discovery for the discovery phase only (brainstorm → search → research → plan → pre-mortem).
Use /validation for the validation phase only (vibe → post-mortem → retro → forge).
Use /rpi for full lifecycle — delegates to /discovery → /crank → /validation.
Use /ratchet to gate/record progress through RPI.

Start Here (12 starters)

These are the skills every user needs first. Everything else is available when you need it.

Skill	Purpose
`/quickstart`	Guided onboarding — run this first
`/bootstrap`	One-command full AgentOps setup — fills gaps only
`/research`	Deep codebase exploration
`/council`	Multi-model consensus review + finding auto-extraction
`/vibe`	Code validation (classification + suppression + domain checklists)
`/rpi`	Full RPI lifecycle orchestrator (`/discovery` → `/crank` → `/validation`)
`/implement`	Execute single issue
`/retro --quick`	Quick-capture a single learning into the flywheel
`/status`	Single-screen dashboard of current work and suggested next action
`/goals`	Maintain GOALS.yaml fitness specification
`/push`	Atomic test-commit-push workflow
`/flywheel`	Knowledge flywheel health monitoring (σ×ρ > δ/100)

Advanced Skills (when you need them)

Skill	Purpose
`/compile`	Active knowledge intelligence — Mine → Grow → Defrag cycle
`/llm-wiki`	External reading wiki proposal — raw sources to compiled wiki
`/harvest`	Cross-rig knowledge consolidation — sweep, dedup, promote to global hub
`/knowledge-activation`	Operationalize a mature `.agents` corpus into beliefs, playbooks, briefings, and gap surfaces
`/brainstorm`	Structured idea exploration before planning
`/discovery`	Full discovery phase orchestrator (brainstorm → search → research → plan → pre-mortem)
`/plan`	Epic decomposition into issues
`/design`	Product validation gate — goal alignment, persona fit, competitive differentiation
`/pre-mortem`	Failure simulation (error/rescue, scope modes, temporal, predictions)
`/post-mortem`	Validation + streak tracking + prediction accuracy + retro history
`/bug-hunt`	Root cause analysis
`/release`	Pre-flight, changelog, version bumps, tag
`/crank`	Autonomous epic loop (uses swarm for each wave)
`/swarm`	Fresh-context parallel execution (Ralph pattern)
`/evolve`	Goal-driven fitness-scored improvement loop
`/autodev`	PROGRAM.md autonomous development contract setup and validation
`/dream`	Interactive Dream operator surface for setup, bedtime runs, and morning reports
`/doc`	Documentation generation
`/retro`	Quick-capture a learning (full retro → /post-mortem)
`/validation`	Full validation phase orchestrator (vibe → post-mortem → retro → forge)
`/ratchet`	Brownian Ratchet progress gates for RPI workflow
`/forge`	Mine transcripts for knowledge — decisions, learnings, patterns
`/readme`	Generate gold-standard README for any project
`/security`	Continuous repository security scanning and release gating
`/security-suite`	Binary and prompt-surface security suite — static analysis, dynamic tracing, offline redteam, policy gating
`/test`	Test generation, coverage analysis, and TDD workflow
`/red-team`	Persona-based adversarial validation — probe docs and skills from constrained user perspectives
`/review`	Review incoming PRs, agent output, or diffs — SCORED checklist
`/refactor`	Safe, verified refactoring with regression testing at each step
`/deps`	Dependency audit, update, vulnerability scanning, and license compliance
`/perf`	Performance profiling, benchmarking, regression detection, and optimization
`/scaffold`	Project scaffolding, component generation, and boilerplate setup
`/scenario`	Author and manage holdout scenarios for behavioral validation

Expert Skills (specialized workflows)

Skill	Purpose
`/grafana-platform-dashboard`	Build Grafana platform dashboards from templates/contracts
`/codex-team`	Parallel Codex agent execution
`/openai-docs`	Official OpenAI docs lookup with citations
`/oss-docs`	OSS documentation scaffold and audit
`/reverse-engineer-rpi`	Reverse-engineer a product into feature catalog and specs
`/pr-research`	Upstream repository research before contribution
`/pr-plan`	External contribution planning
`/pr-implement`	Fork-based PR implementation
`/pr-validate`	PR-specific validation and isolation checks
`/pr-prep`	PR preparation and structured body generation
`/pr-retro`	Learn from PR outcomes
`/complexity`	Code complexity analysis
`/product`	Interactive PRODUCT.md generation
`/handoff`	Session handoff for continuation
`/recover`	Post-compaction context recovery
`/trace`	Trace design decisions through history
`/provenance`	Trace artifact lineage to sources
`/beads`	Issue tracking operations
`/heal-skill`	Detect and fix skill hygiene issues
`/converter`	Convert skills to Codex/Cursor formats
`/update`	Reinstall all AgentOps skills from latest source

Knowledge Flywheel

Every /post-mortem promotes learnings and patterns into .agents/ so future /research starts with better context instead of zero.

Inspect, lint, and triage the .agents/ write surface contract via ao agents inspect | lint | doctor (doctor rolls up inspect + lint + orphan/stray-dir report; --strict fails on orphans).

Runtime Modes

AgentOps has four runtime modes. Do not assume hook automation exists everywhere.

Mode	When it applies	Start path	Closeout path	Guarantees
`gc`	Gas City (`gc`) binary available and `city.toml` present	gc controller manages sessions; `ao rpi` auto-selects gc executor	gc event bus captures phase/gate/failure/metric events	Default when gc is available. Phase execution via gc sessions, events via gc event bus, agent health via gc health patrol
`hook-capable`	Claude/OpenCode with lifecycle hooks installed (no gc)	Runtime hook or `ao inject` / `ao lookup`	Runtime hook or `ao forge transcript` + `ao flywheel close-loop`	Automatic startup/context injection and session-end maintenance when hooks are installed
`codex-native-hooks`	Codex CLI v0.115.0+ with native hook support (March 2026)	Runtime hooks (same as hook-capable)	Runtime hooks (same as hook-capable)	Native lifecycle hooks — same guarantees as hook-capable mode
`codex-hookless-fallback`	Codex Desktop / Codex CLI pre-v0.115.0 without hook surfaces	`ao codex start`	`ao codex stop`	Explicit startup context, citation tracking, transcript fallback, and close-loop metrics without hooks
`manual`	No hooks and no Codex-native runtime detection	`ao inject` / `ao lookup`	`ao forge transcript` + `ao flywheel close-loop`	Works everywhere, but lifecycle actions are operator-driven

Issue Tracking

This workflow uses beads for git-native issue tracking:

bd ready              # Unblocked issues
bd show <id>          # Issue details
bd close <id>         # Close issue
bd vc status          # Inspect Dolt state if needed (JSONL auto-sync is automatic)

Examples

Startup Context Loading

Hook-capable runtimes

session-start.sh (or equivalent) can run at session start.
In manual mode, MEMORY.md is auto-loaded and the hook points to on-demand retrieval (ao search, ao lookup).
In lean mode, the hook extracts pending knowledge and injects prior learnings with a reduced token budget.
This skill can be injected automatically into session context.

Codex (v0.115.0+: native hooks, older: hookless fallback)

v0.115.0+: hooks fire automatically — same behavior as hook-capable runtimes above.
Pre-v0.115.0: run ao codex start explicitly, use ao lookup for citations, end with ao codex stop.

Result: The agent gets the RPI workflow, prior context, and a citation path in all modes.

Workflow Reference During Planning

User says: "How should I approach this feature?"

What happens:

Agent references this skill's RPI workflow section
Agent recommends Research → Plan → Implement → Validate phases
Agent suggests /research for codebase exploration, /plan for decomposition
Agent explains /pre-mortem for failure simulation before implementation
User follows recommended workflow with agent guidance

Result: Agent provides structured workflow guidance based on this meta-skill, avoiding ad-hoc approaches.

Troubleshooting

Problem	Cause	Solution
Skill not auto-loaded	Hook runtime unavailable or startup path not run	Hook-capable runtimes: verify `hooks/session-start.sh` exists and is enabled. Codex: run `ao codex start` explicitly
Outdated skill catalog	This file not synced with actual skills/ directory	Update skill list in this file after adding/removing skills
Wrong skill suggested	Natural language trigger ambiguous	User explicitly calls skill with `/skill-name` syntax
Workflow unclear	RPI phases not well-documented here	Read full workflow guide in README.md or docs/ARCHITECTURE.md