Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By mylukin
Harness AI agents for task-driven development: initialize project harness with task backlog, break requirements into granular tasks via PM/UX/Tech/QA agents, enforce TDD red-green-refactor workflows via CLI for single or batch tasks, track progress, and auto-generate architecture docs.
npx claudepluginhub mylukin/agent-foreman --plugin agent-foremanScan codebase and generate ARCHITECTURE.md documentation
Initialize task harness with ai/tasks/ directory structure
Get next task and implement with TDD workflow
Auto-complete all pending tasks or work on specific task
Multi-Expert Council for transforming requirements into fine-grained task files
Spec breakdown agent for foreman-spec workflow. Reads all spec files (PM, UX, TECH, QA), synthesizes OVERVIEW.md, creates BREAKDOWN task files for all modules, and updates index.json. Returns structured result to orchestrator.
Task management orchestrator for agent-foreman CLI. Analyzes user intent and delegates to skills - feature-next (single task), feature-run (batch processing), init-harness (project setup), project-analyze (codebase analysis). Handles TDD mode detection and verification. Triggers on 'agent-foreman', 'next task', 'run tasks', 'check task', 'TDD workflow', 'task status'.
Task implementation agent for feature-run workflow. Executes the next-implement-check cycle for a single task. Handles TDD workflow (RED-GREEN-REFACTOR) when strict mode is active. Returns structured results for orchestrator to process.
Product Manager agent for spec workflow. Clarifies WHAT and WHY of requirements. Identifies target users, business goals, success metrics, scope boundaries, and assumptions. First analyst in the serial workflow - insights inform all subsequent roles.
QA Manager agent for spec workflow. Designs HOW to verify the system. Defines test strategy, risk assessment, quality gates, and acceptance criteria verification. Fourth (final) analyst in serial workflow - has complete context from PM, UX, and Tech.
Implements a single task following the next → implement → check → done workflow with TDD support. Use when working on one specific task, implementing a single feature from the backlog, or following TDD red-green-refactor cycle. Triggers on 'next task', 'next feature', 'implement feature', 'work on feature', 'single task mode', 'what should I work on'.
Executes unattended batch processing of all pending tasks with autonomous decision-making. Use when running all tasks automatically, batch processing without supervision, completing entire feature backlog, or working on a specific task by ID. Triggers on 'run all tasks', 'complete all features', 'batch processing', 'unattended mode', 'auto-complete tasks', 'run feature'.
Multi-role requirement analysis and task breakdown workflow using 4 specialized AI agents (PM, UX, Tech, QA). Each agent conducts web research before analysis to gather industry best practices, case studies, and current trends. Supports Quick Mode (parallel, ~3 min, one Q&A session) and Deep Mode (serial, ~8 min, Q&A after EACH agent so answers inform subsequent analysis). Triggers on 'foreman-spec', 'spec feature', 'break down requirement', 'define tasks', 'spec this'.
Creates AI agent task management structure with feature backlog (ai/tasks/), TDD enforcement, and progress tracking. Use when setting up agent-foreman, initializing feature-driven development, creating task backlog, or enabling TDD mode. Triggers on 'init harness', 'setup feature tracking', 'create feature backlog', 'enable strict TDD', 'initialize agent-foreman'.
Scans codebases to generate architecture documentation (ARCHITECTURE.md). Use when joining an existing project, understanding codebase structure, exploring project architecture, or preparing for agent-foreman init. Triggers on 'analyze project', 'understand codebase', 'explore architecture', 'scan project structure', 'survey project'.
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Long-running agent harness with 5-layer memory architecture, GitHub integration, autonomous batch processing, Agent Teams with ATDD, 9 hooks (safety, quality gates, team coordination), and 6 Agent Skills
HelloAGENTS — The orchestration kernel that makes any AI CLI smarter. Adds intelligent routing, quality verification (Ralph Loop), safety guards, and notifications.
HarnessFlow — From idea to shipped product: high-quality engineering workflows for AI agents. Spec-anchored SDD, gated TDD, evidence-based routing, independent reviews, and formal closeout.
This skill should be used when the model's ROLE_TYPE is orchestrator and needs to delegate tasks to specialist sub-agents. Provides scientific delegation framework ensuring world-building context (WHERE, WHAT, WHY) while preserving agent autonomy in implementation decisions (HOW). Use when planning task delegation, structuring sub-agent prompts, or coordinating multi-agent workflows.
Plan-first AI development with batched parallelism. Native Claude Code implementation of the Agent Hive workflow.
Complete project development toolkit: 23 agents, 23 slash commands, 29 lifecycle hooks, and 69 reusable skills for Claude Code workflows
Autonomous end-to-end development system - from requirement to production-ready code with zero manual intervention
⚠️ DEPRECATED: This project is no longer maintained. Please migrate to ralph-dev - a simplified and improved version.
This project has been deprecated due to its complexity. A new, simplified version is now available:
👉 https://github.com/mylukin/ralph-dev
Please migrate to ralph-dev for continued support and updates.
Stop AI agents from half-building features. Ship complete code in one session.
AI coding agents face three common failure modes:
agent-foreman provides a structured harness that enables AI agents to:
/plugin install agent-foreman # 1. Install
/agent-foreman:init Build auth API # 2. Initialize
/agent-foreman:run # 3. Let AI work
# Quick install (binary)
curl -fsSL https://raw.githubusercontent.com/mylukin/agent-foreman/main/scripts/install.sh | bash
# Via npm
npm install -g agent-foreman
# Or use npx directly
npx agent-foreman --help
Manual download: GitHub Releases
/plugin marketplace add mylukin/agent-foreman
/plugin install agent-foreman
| Command | Description |
|---|---|
/agent-foreman:status | View project status and progress |
/agent-foreman:init | Initialize harness with project goal |
/agent-foreman:analyze | Analyze existing project structure |
/agent-foreman:spec | Transform requirements into tasks |
/agent-foreman:next | Get next priority task |
/agent-foreman:run | Auto-complete all pending tasks |
Transform requirements into tasks:
/agent-foreman:spec Build a user authentication system
Requirement → [PM→UX→Tech→QA] → Spec Files → BREAKDOWN Tasks → /run → Implementation
For standalone CLI usage without Claude Code:
| Command | Description |
|---|---|
init [goal] | Initialize or upgrade the harness |
next [feature_id] | Show next feature to work on |
status | Show current project status |
check [feature_id] | Verify code changes or task completion |
done <feature_id> | Verify, mark complete, and auto-commit |
fail <feature_id> | Mark a task as failed |
impact <feature_id> | Analyze impact of changes |
tdd [mode] | View or set TDD mode |
agents | Show available AI agents |
install | Install Claude Code plugin |
uninstall | Uninstall Claude Code plugin |
next → implement → check → done → repeat
| Step | Command | What Happens |
|---|---|---|
| 1 | next | Get task with acceptance criteria |
| 2 | implement | Write code to satisfy criteria |
| 3 | check | Verify implementation |
| 4 | done | Mark complete, auto-commit |
| File | Purpose |
|---|---|
ai/tasks/index.json | Task index with status summary |
ai/tasks/{module}/{id}.md | Individual task definitions |
ai/progress.log | Session handoff audit log |
ai/init.sh | Environment bootstrap script |
CLAUDE.md | AI agent instructions |
| Status | Meaning |
|---|---|
failing | Not yet implemented |
passing | Acceptance criteria met |
blocked | External dependency blocking |
needs_review | May be affected by changes |
failed | Verification failed |
deprecated | No longer needed |
AI agents need the same tooling that makes human teams effective: