Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By selcukyucel
Agentic AI development workflow for Claude Code — AI-specific skills, agents, hooks, and project context for teams building AI automations
npx claudepluginhub selcukyucel/north-starr-genai --plugin north-starr-genaiDesign UI/UX patterns for AI-powered interfaces. Produces interaction specs for conversational UI, dashboards, approval workflows, confidence display, streaming UX, and error states. Spawned during BUILD when the plan includes a user-facing AI interface. Runs on a separate thread.
Technical design agent for AI stories. Produces architecture decisions, model selection, cost envelopes, and routes to invert and cost-estimator. Reads prior decisions from DECISIONS.md. Runs on a separate thread.
AI-specific inversion analysis agent. Given a requirement or feature description, produces `.plans/INVERT-<name>.md` covering prompt fragility, hallucination, cost, drift, data pipeline, guardrails, and observability. Runs on a separate thread. Invoked via `/ai-invert` skill or orchestrator dispatch on Q1/Q2 gate hits.
Configure monitoring, alerting, and observability for AI automations. Designs dashboards, cost tracking, accuracy drift detection, and alerting rules. Runs on a separate thread.
Autonomously improve any skill or agent prompt using a measure-change-test hill-climbing loop. Runs the target repeatedly, scores output against a yes/no checklist, makes one small change per round, keeps improvements, reverts regressions. Runs on a separate thread. Invoked via `/autoimprove` skill.
Run AI-specific inversion analysis on a requirement before implementation. Dispatches the `ai-invert-analyst` agent on a separate thread. Use before complex or high-stakes AI tasks that touch prompts, models, RAG, or AI-powered outputs.
Generate executable pytest test files for AI outputs. Produces assertion-based tests for deterministic AI components (classification, extraction, routing, structured output) that run in CI/CD. Complements /eval-suite which produces statistical evaluation datasets for non-deterministic outputs.
Analyze code modules and files for refactoring opportunities, code smells, and architectural pattern violations in any language or framework. Use this skill when the user asks to "analyze code smells", "find refactoring opportunities", "check for code quality issues", or "review architecture" for a specific module or file.
Classify project type, recommend approach, identify needed agents, estimate complexity, and flag risks. Runs BEFORE /decompose to help North Starr adapt its pipeline to what is being built.
Autonomously improve any skill or agent prompt via measure-change-test hill-climbing. Dispatches the `auto-improver` agent on a separate thread. Use when a skill gives inconsistent results, when asked to "improve/optimize/autoresearch" a skill, or when output quality needs iterative tightening.
Modifies files
Hook triggers on file write and edit operations
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
AI 오케스트라의 그랜드마스터 — PM이 코드를 쓰지 않고 AI 에이전트를 지휘하여 개발
Enterprise AI agent orchestration plugin with 150+ commands, 74+ specialized agents, SPARC methodology, swarm coordination, GitHub integration, and neural training capabilities
Mission Planner, Agent Creator, Skill Creator, and Librarian — the complete Forge suite for science-backed AI team assembly
Use this agent when implementing AI/ML features, integrating language models, building recommendation systems, or adding intelligent automation to applications. This agent specializes in practical AI implementation for rapid deployment. Examples:\n\n<example>\nContext: Adding AI features to an app\nuser: "We need AI-powered content recommendations"\nassistant: "I'll implement a smart recommendation engine. Let me use the ai-engineer agent to build an ML pipeline that learns from user behavior."\n<commentary>\nRecommendation systems require careful ML implementation and continuous learning capabilities.\n</commentary>\n</example>\n\n<example>\nContext: Integrating language models\nuser: "Add an AI chatbot to help users navigate our app"\nassistant: "I'll integrate a conversational AI assistant. Let me use the ai-engineer agent to implement proper prompt engineering and response handling."\n<commentary>\nLLM integration requires expertise in prompt design, token management, and response streaming.\n</commentary>\n</example>\n\n<example>\nContext: Implementing computer vision features\nuser: "Users should be able to search products by taking a photo"\nassistant: "I'll implement visual search using computer vision. Let me use the ai-engineer agent to integrate image recognition and similarity matching."\n<commentary>\nComputer vision features require efficient processing and accurate model selection.\n</commentary>\n</example>
Editorial "Agent Architect" bundle for Claude Code from Antigravity Awesome Skills.
Multi-agent team orchestration for Claude Code. Set up parallel AI agent teams with file-based planning, progress tracking, and role-based collaboration.
Agentic development workflow for Claude Code — skills, agents, and project context from your codebase
Your North Starr for AI Development | v0.15.0
An agentic AI development agency framework — North Starr plans, designs, validates, and orchestrates while Claude Code writes code in YOUR codebase. Works with any project: RAG pipelines, agent harnesses, multi-agent systems, prompt chains, or AI platform components.
┌──────────┐
│ /assess │ ← classifies project type
└────┬─────┘
│
┌────┴─────┐
│/discover │ ← elicits requirements (if needed)
└────┬─────┘
│
┌────┴──────┐
│/decompose │ ← PRD → stories
└────┬──────┘
│
┌──────┴──────┐
│ ORCHESTRATOR │
└──────┬──────┘
│
┌──────────┴──────────┐
│ chief-ai-po │ TRIAGE
└──────────┬──────────┘
│
┌──────────┴──────────┐
│ ai-architect │ DESIGN
└────┬──────────┬─────┘
│ │
┌─────────┘ └──────────┐
▼ ▼
┌──────────┐ ┌───────────────┐
│ ai-invert│ │cost-estimator │
└────┬─────┘ └───────┬───────┘
└───────────────┬───────────────┘
▼
┌──────────────────┐
│ genai-layoutplan │ PLAN (tags tasks with specialists)
└────────┬─────────┘
│
┌────────────────┼────────────────┬──────────────┐
▼ ▼ ▼ ▼
┌────────────┐ ┌────────────┐ ┌───────────────┐ ┌──────────┐
│ prompt- │ │ rag- │ │ integration- │ │ agentic- │
│ engineer │ │ advisor │ │ planner │ │ designer │
└─────┬──────┘ └─────┬──────┘ └───────┬───────┘ └────┬─────┘
└───────────────┼─────────────────┼───────────────┘
▼ BUILD
┌─────────────┼──────────────┐
▼ ▼ ▼
┌────────────┐ ┌────────────┐ ┌───────────┐
│ eval- │ │ guardrails-│ │ ai-ops │
│ designer │ │ designer │ │ (monitor) │
└─────┬──────┘ └──────┬─────┘ └─────┬─────┘
│ ┌────┴─────┐ │
│ │ prompt- │ │
│ │adversary │ │
│ └──────────┘ │
└───────────────┼────────────┘
▼ HARDEN
┌────────────┐
│demo-builder│ DELIVER
└────────────┘
Feedback loops:
eval fails ──→ prompt-engineer (fix prompt)
guardrails fail ──→ ai-architect (fix design)
cost overrun ──→ ai-architect (cheaper model)
same gate fails twice ──→ HUMAN escalation
North Starr GenAI is the brain of an AI development agency. It doesn't generate code — it generates the specs, designs, evaluations, and guardrails that make AI code production-grade.
CLIENT (you) → gives requirement
NORTH STARR (brain) → plans, designs, validates, orchestrates, quality-gates
CLAUDE CODE (hands) → reads North Starr's specs + writes code in YOUR codebase
/genai-bootstrap → makes Claude Code aware of your specific codebase patterns
Requirement → /assess (classify project type)
→ /discover (elicit requirements if needed)
→ /decompose (PRD → stories with AI safety criteria)
→ /orchestrate (start the pipeline)
→ TRIAGE: chief-ai-po refines story
→ DESIGN: ai-architect → ADR + cost envelope
→ PLAN: genai-layoutplan → tasks with specialist tags
→ BUILD: specialists produce specs → Claude Code implements
→ HARDEN: eval + guardrails + ops validate (ALL must pass)
→ DELIVER: demo-builder packages for client
Before ANY code change, the gate catches AI-specific risks:
| # | Question | Why |
|---|---|---|
| Q0 | Is current behavior covered by evals? | Eval-first discipline |
| Q1 | Does this touch a production prompt or model config? | Prompt changes are high-risk |
| Q2 | Does this change what data the model sees? | Data changes alter model behavior |
| Q3 | Does this affect a client-facing output? | Client-visible changes need baselines |
| Q4 | Could this change cost at scale? | Cost is a first-class concern |
Based on answers, it routes through: ASSESS → BUILD (specialists auto-spawn) → HARDEN (validators auto-run) → COMPLETE → LEARN.
The plugin ships two hooks that fire automatically: