Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Reusable team-of-agents plugin for the full SDLC — feature design, development, bugfix, environment analysis — with built-in RALF iteration loop, layered memory, and systematic eval. Project-agnostic: operations live in plugin, project context lives in target repo (CLAUDE.md / AGENTS.md).
npx claudepluginhub alex-voloshin-dev/ai-skills --plugin ai-skillsCloud Architecture — Azure, Google Cloud Platform, multi-cloud strategy, cloud landing zones, Azure Landing Zones, GCP organization hierarchy, Well-Architected Framework, cloud migration, FinOps, cost optimization, cloud networking, VPC, VNet, peering, VPN, CDN, WAF, cloud security, Entra ID, Cloud IAM, Workload Identity, compliance (GDPR, HIPAA, SOC 2, ISO 27001), managed services, AKS, GKE, serverless, Cloud Run, Azure Functions, data services, Cosmos DB, Spanner, BigQuery, Terraform, Bicep, disaster recovery, high availability
Content Design — page content strategy, conversion copywriting, visual content direction, landing page optimization, hero sections, value propositions, CTAs, social proof, testimonials, brand voice, content hierarchy, F-pattern, Z-pattern scanning, emotional design, storytelling, AI content generation, image selection, infographics, product promotion, attention-grabbing content, content personalization
Technical Content Writing — documentation strategy, Diátaxis framework, API documentation, UI microcopy, technical guides, docs-as-code, information architecture, release notes, troubleshooting guides, style guides, AI product documentation, SEO for docs
Data Engineering — ETL/ELT pipelines, Apache Spark, Apache Airflow, dbt, data warehousing, data lakes, BigQuery, Snowflake, Kafka, data modeling, data quality, schema evolution, batch and stream processing, data governance, data observability
Database Engineering — PostgreSQL, MySQL, SQL Server, MongoDB, Redis, Cassandra, DynamoDB, database design, schema modeling, SQL optimization, indexing strategies, partitioning, replication, backup and recovery, migrations, stored procedures, database security, connection pooling, NoSQL data modeling, time-series databases, graph databases, database monitoring, capacity planning
Use this skill when bootstrapping a target repository to be ai-skills-aware — on the first run of any ai-skills workflow in a fresh repo, when adopting the ai-skills plugin in an existing repo, or after upgrading to a plugin version that adds new memory paths or templates, including when the user does not say "init" but asks to "set up" or "onboard" the repo — to detect codebase type, create CLAUDE.md + AGENTS.md scaffolding, initialize the .ai-skills-memory/ directory tree from L1 templates, and configure .gitignore. Idempotent — safe to re-run. Accepts `--codebase-type <type>` and `--overwrite`. Not for re-initializing only memory — use `/memory-init` instead.
Use this skill when a local container won't start, a service is unreachable from the host, a local docker-compose stack is misbehaving, or as the Docker-layer diagnosis step of a local bugfix flow — including when the user describes the symptom without naming Docker — to run a Docker-specific local diagnostic that collects container status, logs, networking, and resource usage and diagnoses issues, applying the SRE or DevOps role for investigation. For multi-scope environment analysis (Docker + Kubernetes + CI runner + drift snapshot + optional auto-fix) use `/env-analyze` instead.
Use this skill when investigating a production incident, when an alert fires (latency spike, error rate, pod crashloop), when a customer-reported issue needs prod telemetry, or as the diagnosis step of an incident-response or production-bugfix flow — including when the user describes a prod symptom without asking to "analyze" — to analyze the production environment by collecting Kubernetes pod status, managed database health, logs, metrics, and networking and diagnosing issues, supporting GCP, Azure, and AWS via the `cloud-platforms` skill and applying the SRE or DevOps role.
Use this skill when the user asks to analyze, investigate, assess, evaluate, compare, or research a codebase, architecture, system, product, market, or research topic in a structured way — including requests that need explicit scope, framework-driven reasoning, and separated facts/inference/recommendation but do not use the word "analyze" — to run a deep analysis workflow producing traceable scope, evidence, and conclusions. Not for quick one-off lookups or implementation work — this is an investigation-and-conclusions workflow.
Use this skill when a service, module, or system needs documentation, audit, or tech-debt assessment — onboarding a new team to an undocumented service, establishing a pre-redesign "what do we actually have?" baseline, or a tech-debt audit before roadmap planning, including when the user does not say "architecture" — to analyze the existing architecture and produce ARCHITECTURE.md, current-state C4 diagrams, a component inventory, gap analysis (current vs target), and a technical-debt register. Routes to system-architect (default — system/module/service analysis), cloud-architect (`--cloud` for cloud-resource inventory + cost + compliance review), or devops-architect (`--cicd` for CI/CD platform audit + DORA baseline + governance review).
Terse, change-focused output style for /develop PR descriptions. Forces summary-first, bullet-listed changes, minimal narrative. Use when generating PRs from /develop or /create-pr workflows where reviewers need to scan quickly.
Structured Markdown output style with consistent heading hierarchy for /feature-design artefacts (PRD, ARCHITECTURE, UX-FLOW, DATA-MODEL, RISKS, IMPLEMENTATION-PLAN, REVIEW-LOG). Use when emitting design pack files to <repo>/docs/features/<feature-id>/.
Matches all tools
Hooks run on every tool call, not just specific ones
Executes bash commands
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
This plugin requires configuration values that are prompted when the plugin is enabled. Sensitive values are stored in your system keychain.
env_watch_enabledOpt-in: starts a background monitor watching docker-compose services if a compose file is detected at session start. Emits one JSON line per service-state change to stdout. Honors SIGTERM cleanly. Exits silently when no compose file or docker CLI is present.
${user_config.env_watch_enabled}env_watch_intervalHow often the env-watch monitor polls `docker compose ps` for state changes. Minimum 5s. Only used when env_watch_enabled is true.
${user_config.env_watch_interval}subagent_max_depthPhase 4 #4 (v0.1.7): max recursion depth allowed for subagent spawn chains. Enforced by subagent-depth-guard.py hook on SubagentStart. Default 3 per subagent-isolation.md (main thread -> orchestrator -> domain agent -> nested = 1 -> 2 -> 3). Anthropic runtime normally enforces depth=1 max; this is a defensive backstop.
${user_config.subagent_max_depth}ralph_default_max_iterDefault --max-iterations cap for /ralph (per D12).
${user_config.ralph_default_max_iter}ralph_session_max_iterTotal RALF iterations across ALL workflows in one session (Round 6 HIGH-3). Prevents runaway when chaining /feature-design then /develop in one session.
${user_config.ralph_session_max_iter}session_token_hard_capCumulative session token count at which the plugin pauses and asks for confirmation.
${user_config.session_token_hard_cap}session_token_soft_capCumulative session token count at which the plugin warns the user.
${user_config.session_token_soft_cap}ralph_default_token_budgetDefault --token-budget cap for /ralph (per D12).
${user_config.ralph_default_token_budget}ralph_session_token_budgetTotal RALF token spend across all workflows in one session (Round 6 HIGH-3).
${user_config.ralph_session_token_budget}subagent_learnings_enabledOpt-in: enables subagent-stop-learnings hook to capture non-trivial outputs for memory-curator review.
${user_config.subagent_learnings_enabled}user_global_memory_enabledOpt-in: allows /learnings-write to persist patterns to ~/.claude/ai-skills/learnings.md across all projects.
${user_config.user_global_memory_enabled}ralph_default_time_cap_minutesDefault --time-cap for /ralph (per D12).
${user_config.ralph_default_time_cap_minutes}ralph_session_time_cap_minutesTotal RALF wall-time across all workflows in one session (Round 6 HIGH-3).
${user_config.ralph_session_time_cap_minutes}Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Harness-native ECC plugin for engineering teams - 64 agents, 262 skills, 84 legacy command shims, reusable hooks, rules, MCP conventions, and operator workflows for Claude Code plus adjacent agent harnesses
Access thousands of AI prompts and skills directly in your AI coding assistant. Search prompts, discover skills, save your own, and improve prompts with AI.
Develop, test, build, and deploy Godot 4.x games with Claude Code. Includes GdUnit4 testing, web/desktop exports, CI/CD pipelines, and deployment to Vercel/GitHub Pages/itch.io.
Comprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Hook triggers when Bash tool is used
Hook triggers when Bash tool is used
Modifies files
Hook triggers on file write and edit operations
Modifies files
Hook triggers on file write and edit operations
Uses power tools
Uses Bash, Write, or Edit tools
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Vendor-agnostic agentic-dev tooling: 26 agents, 73 skills, and 45 eval rubrics that work across Claude Code, Codex, and Windsurf.
Engineering teams adopting AI coding agents hit the same wall: ad-hoc patterns that work for one developer don't scale to the team, evaluation is hand-wavy, and switching runtimes means rewriting everything. This repo is the working playbook of patterns that survived production use across all three major agentic-dev runtimes — with a tracked parity matrix so you know exactly what's available where, and 45 eval rubrics with 270 calibrated samples so "is it working?" has an answer.
| Asset | Count | What it is |
|---|---|---|
| Agents | 26 | Specialized sub-agents (review, planning, testing, refactor, security, etc.) — invoked from a parent agent or a slash command |
| Skills | 53 | Reusable, named instruction packs that adapt the agent's behavior to a domain (Python, Go, AWS, frontend, etc.) |
| Eval rubrics | 45 | Scorable rubrics for AI coding-agent output — code quality, test discipline, prompt-following, etc. |
| Calibration samples | 270 | Labeled examples (6 per rubric) that show what each rubric catches in practice — not theory |
| Hooks | 18 | Pre/post-action interceptors for guardrails (no git commit, file size limits, etc.) |
| Rules | 12 | Cross-cutting policies enforced at the agent or repo level |
| Workflows | 32 | User-invocable slash commands that compose agents + skills into a multi-step recipe |
Codex + Windsurf parity packages: 39 shared skills, 22 roles, 8
rules. Cross-runtime parity is tracked in
review/parity-matrix.md — check there for
the per-runtime availability of any capability.
Almost every public agentic-dev expert is locked to one runtime. This repo tracks parity across all three with a public matrix. When a feature lands in Claude Code first, the matrix tracks the gap and the workaround for Codex / Windsurf until parity is reached.
Most public discourse on AI coding-agent evaluation is a vague "AI eval is hard". The 45 rubrics ship with 270 calibrated samples showing what each rubric actually catches in production code. If you're operationalizing AI coding agents on a team, "how do I know it's working?" has an answer.
LTL, model checking, and FRR techniques apply to agent specifications more directly than people think. The intersection between formal methods and agentic dev is one of the least-explored angles in public discourse, and a few patterns in this repo lean on that background.
Prerequisites — clone the repo:
git clone https://github.com/alex-voloshin-dev/ai-skills.git
cd ai-skills
The plugin is not yet on a public marketplace. Install from the local clone:
claude --plugin-dir ./plugin
All 32 user-invocable workflows appear under the ai-skills: namespace
in /help. To reload after editing plugin files in the same session:
/reload-plugins
See plugin/README.md for full plugin install and
usage, or plugin/docs/getting-started.md
for a guided tour.
The installer syncs the Codex package (.codex/) plus the shared
skills (.agents/) into your home directory:
bash install.sh
Idempotent — files removed from the repo are also removed from
~/.codex/ / ~/.agents/ on re-run. Codex root instructions live in
AGENTS.md.
Same installer — it also syncs the Windsurf package (.windsurf/)
into ~/.windsurf/:
bash install.sh
On Windows use install.ps1 instead. Windsurf-native hooks are wired
through .windsurf/hooks.json — see
TESTING.md for hook validation.
A minimal first run — once installed, sample the eval harness in dry-run mode (no API key needed) to see the rubric system at work:
$ python3 plugin/eval/runner.py --tier 2 --dry-run --sample-rubrics 3 --samples-per-rubric 2
=== Tier 2 — Judge-Calibration Drift Smoke ===
Sample seed: 42
Rubrics sampled: 3 (analyze, code-review, spike)
API available: False
Tokens used: 0 (soft 50000, hard 150000)
[ERR ] analyze good tech-stack-comparison.score-4.0.md -- dry-run
[ERR ] analyze bad opinion-without-evidence.score-1.0.md -- dry-run
[ERR ] code-review good clean-pass.score-3.8.md -- dry-run
[ERR ] code-review bad mixed-with-security-scan.score-1.4.md -- dry-run
[ERR ] spike good auth-openidconnect-vs-oauth.score-4.4.md -- dry-run
[ERR ] spike bad unsubstantiated-claim.score-1.3.md -- dry-run