From agentops
Explains AgentOps operating model, RPI lifecycle (Research-Plan-Implement-Validate), skills (/discovery, /crank, /swarm, /validation), hooks, CLI commands (ao search, /research), and context management.
npx claudepluginhub boshu2/agentops --plugin agentopsThis skill uses the workspace's default tool permissions.
AgentOps is the operational layer for coding agents.
Creates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.
Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.
Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.
AgentOps is the operational layer for coding agents.
Publicly, it gives you four things:
/research, /validation, and /rpiTechnically, AgentOps acts as a context compiler: raw session signal becomes reusable knowledge, compiled prevention, and better next work.
Research → Plan → Implement → Validate
↑ │
└──── Knowledge Flywheel ────┘
/research <topic> # Deep codebase exploration
ao search "<query>" # Search existing knowledge
ao search "<query>" --cite retrieved # Record adoption when a search result is reused
ao lookup <id> # Pull full content of specific learning
ao lookup --query "x" # Search knowledge by relevance
Output: .agents/research/<topic>.md
/pre-mortem <spec> # Simulate failures (error/rescue map, scope modes, prediction tracking)
/plan <goal> # Decompose into trackable issues
Output: Beads issues with dependencies
/implement <issue> # Single issue execution
/crank <epic> # Autonomous epic loop (uses swarm for waves)
/swarm # Parallel execution (fresh context per agent)
Output: Code changes, tests, documentation
/vibe [target] # Code validation (finding classification + suppression + domain checklists)
/post-mortem # Validation + streak tracking + prediction accuracy + retro history
/retro # Quick-capture a single learning
Output: .agents/learnings/, .agents/patterns/
| Phase | Primary Skill | Supporting Skills |
|---|---|---|
| Discovery | /discovery | /brainstorm, /research, /plan, /pre-mortem |
| Implement | /crank | /implement (single issue), /swarm (parallel execution) |
| Validate | /validation | /vibe, /post-mortem, /retro, /forge |
Choosing the skill:
/implement for single issue execution. Now defaults to TDD-first — writes failing tests before implementing. Skip with --no-tdd./crank for autonomous epic execution (loops waves via swarm until done). Auto-generates file-ownership maps to prevent worker conflicts./discovery for the discovery phase only (brainstorm → search → research → plan → pre-mortem)./validation for the validation phase only (vibe → post-mortem → retro → forge)./rpi for full lifecycle — delegates to /discovery → /crank → /validation./ratchet to gate/record progress through RPI.These are the skills every user needs first. Everything else is available when you need it.
| Skill | Purpose |
|---|---|
/quickstart | Guided onboarding — run this first |
/bootstrap | One-command full AgentOps setup — fills gaps only |
/research | Deep codebase exploration |
/council | Multi-model consensus review + finding auto-extraction |
/vibe | Code validation (classification + suppression + domain checklists) |
/rpi | Full RPI lifecycle orchestrator (/discovery → /crank → /validation) |
/implement | Execute single issue |
/retro --quick | Quick-capture a single learning into the flywheel |
/status | Single-screen dashboard of current work and suggested next action |
/goals | Maintain GOALS.yaml fitness specification |
/push | Atomic test-commit-push workflow |
/flywheel | Knowledge flywheel health monitoring (σ×ρ > δ/100) |
| Skill | Purpose |
|---|---|
/compile | Active knowledge intelligence — Mine → Grow → Defrag cycle |
/llm-wiki | External reading wiki proposal — raw sources to compiled wiki |
/harvest | Cross-rig knowledge consolidation — sweep, dedup, promote to global hub |
/knowledge-activation | Operationalize a mature .agents corpus into beliefs, playbooks, briefings, and gap surfaces |
/brainstorm | Structured idea exploration before planning |
/discovery | Full discovery phase orchestrator (brainstorm → search → research → plan → pre-mortem) |
/plan | Epic decomposition into issues |
/design | Product validation gate — goal alignment, persona fit, competitive differentiation |
/pre-mortem | Failure simulation (error/rescue, scope modes, temporal, predictions) |
/post-mortem | Validation + streak tracking + prediction accuracy + retro history |
/bug-hunt | Root cause analysis |
/release | Pre-flight, changelog, version bumps, tag |
/crank | Autonomous epic loop (uses swarm for each wave) |
/swarm | Fresh-context parallel execution (Ralph pattern) |
/evolve | Goal-driven fitness-scored improvement loop |
/autodev | PROGRAM.md autonomous development contract setup and validation |
/dream | Interactive Dream operator surface for setup, bedtime runs, and morning reports |
/doc | Documentation generation |
/retro | Quick-capture a learning (full retro → /post-mortem) |
/validation | Full validation phase orchestrator (vibe → post-mortem → retro → forge) |
/ratchet | Brownian Ratchet progress gates for RPI workflow |
/forge | Mine transcripts for knowledge — decisions, learnings, patterns |
/readme | Generate gold-standard README for any project |
/security | Continuous repository security scanning and release gating |
/security-suite | Binary and prompt-surface security suite — static analysis, dynamic tracing, offline redteam, policy gating |
/test | Test generation, coverage analysis, and TDD workflow |
/red-team | Persona-based adversarial validation — probe docs and skills from constrained user perspectives |
/review | Review incoming PRs, agent output, or diffs — SCORED checklist |
/refactor | Safe, verified refactoring with regression testing at each step |
/deps | Dependency audit, update, vulnerability scanning, and license compliance |
/perf | Performance profiling, benchmarking, regression detection, and optimization |
/scaffold | Project scaffolding, component generation, and boilerplate setup |
/scenario | Author and manage holdout scenarios for behavioral validation |
| Skill | Purpose |
|---|---|
/grafana-platform-dashboard | Build Grafana platform dashboards from templates/contracts |
/codex-team | Parallel Codex agent execution |
/openai-docs | Official OpenAI docs lookup with citations |
/oss-docs | OSS documentation scaffold and audit |
/reverse-engineer-rpi | Reverse-engineer a product into feature catalog and specs |
/pr-research | Upstream repository research before contribution |
/pr-plan | External contribution planning |
/pr-implement | Fork-based PR implementation |
/pr-validate | PR-specific validation and isolation checks |
/pr-prep | PR preparation and structured body generation |
/pr-retro | Learn from PR outcomes |
/complexity | Code complexity analysis |
/product | Interactive PRODUCT.md generation |
/handoff | Session handoff for continuation |
/recover | Post-compaction context recovery |
/trace | Trace design decisions through history |
/provenance | Trace artifact lineage to sources |
/beads | Issue tracking operations |
/heal-skill | Detect and fix skill hygiene issues |
/converter | Convert skills to Codex/Cursor formats |
/update | Reinstall all AgentOps skills from latest source |
Every /post-mortem promotes learnings and patterns into .agents/ so future /research starts with better context instead of zero.
Inspect, lint, and triage the .agents/ write surface contract via ao agents inspect | lint | doctor (doctor rolls up inspect + lint + orphan/stray-dir report; --strict fails on orphans).
AgentOps has four runtime modes. Do not assume hook automation exists everywhere.
| Mode | When it applies | Start path | Closeout path | Guarantees |
|---|---|---|---|---|
gc | Gas City (gc) binary available and city.toml present | gc controller manages sessions; ao rpi auto-selects gc executor | gc event bus captures phase/gate/failure/metric events | Default when gc is available. Phase execution via gc sessions, events via gc event bus, agent health via gc health patrol |
hook-capable | Claude/OpenCode with lifecycle hooks installed (no gc) | Runtime hook or ao inject / ao lookup | Runtime hook or ao forge transcript + ao flywheel close-loop | Automatic startup/context injection and session-end maintenance when hooks are installed |
codex-native-hooks | Codex CLI v0.115.0+ with native hook support (March 2026) | Runtime hooks (same as hook-capable) | Runtime hooks (same as hook-capable) | Native lifecycle hooks — same guarantees as hook-capable mode |
codex-hookless-fallback | Codex Desktop / Codex CLI pre-v0.115.0 without hook surfaces | ao codex start | ao codex stop | Explicit startup context, citation tracking, transcript fallback, and close-loop metrics without hooks |
manual | No hooks and no Codex-native runtime detection | ao inject / ao lookup | ao forge transcript + ao flywheel close-loop | Works everywhere, but lifecycle actions are operator-driven |
This workflow uses beads for git-native issue tracking:
bd ready # Unblocked issues
bd show <id> # Issue details
bd close <id> # Close issue
bd vc status # Inspect Dolt state if needed (JSONL auto-sync is automatic)
Hook-capable runtimes
session-start.sh (or equivalent) can run at session start.manual mode, MEMORY.md is auto-loaded and the hook points to on-demand retrieval (ao search, ao lookup).lean mode, the hook extracts pending knowledge and injects prior learnings with a reduced token budget.Codex (v0.115.0+: native hooks, older: hookless fallback)
ao codex start explicitly, use ao lookup for citations, end with ao codex stop.Result: The agent gets the RPI workflow, prior context, and a citation path in all modes.
User says: "How should I approach this feature?"
What happens:
/research for codebase exploration, /plan for decomposition/pre-mortem for failure simulation before implementationResult: Agent provides structured workflow guidance based on this meta-skill, avoiding ad-hoc approaches.
| Problem | Cause | Solution |
|---|---|---|
| Skill not auto-loaded | Hook runtime unavailable or startup path not run | Hook-capable runtimes: verify hooks/session-start.sh exists and is enabled. Codex: run ao codex start explicitly |
| Outdated skill catalog | This file not synced with actual skills/ directory | Update skill list in this file after adding/removing skills |
| Wrong skill suggested | Natural language trigger ambiguous | User explicitly calls skill with /skill-name syntax |
| Workflow unclear | RPI phases not well-documented here | Read full workflow guide in README.md or docs/ARCHITECTURE.md |