By shmayro
Autonomously evolve Claude Code skills through recursive cycles: detect capability gaps in tasks, create new skills via structured workflows, score executions on a 5D rubric (correctness/completeness/edges/efficiency/reusability), diagnose/repair underperformers, crystallize high scorers with git tags, generate health reports, and monitor via dashboard.
npx claudepluginhub shmayro/singularity-claude --plugin singularity-claudeAnalyzes a failed or unsatisfied task to determine if a new skill should be created. Checks existing registry for coverage and recommends skill creation when capability gaps are found.
Evaluates the quality of a skill execution output against the Singularity scoring rubric. Returns a structured score breakdown with per-dimension ratings and rationale. Use this agent for automated skill scoring.
Use when a new reusable skill needs to be built, either detected by gap analysis, requested by the user, or triggered by /singularity-create
Use when a skill has proven itself with consistently high scores across multiple executions and should be locked as a stable, immutable version via /singularity-crystallize
Use when wanting an overview of all singularity-managed skills showing their scores, maturity levels, trends, and alerts via /singularity-dashboard
Use when a singularity-managed skill is failing, scoring below threshold, or producing incorrect outputs that need automated correction via /singularity-repair
Use when wanting to assess the health of a singularity-managed skill, check its maturity, decide whether it needs repair or is ready for crystallization via /singularity-review
Use after any singularity-managed skill execution to rate performance 0-100, track quality over time, and trigger repair when scores drop below threshold
Use when starting any conversation where skills may need to be created, evaluated, repaired, or when a capability gap is detected during task execution
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Evidence-based agent skills compiler with progressive capability tiers (Quick/Forge/Forge+/Deep).
Create new skills, improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, update or optimize an existing skill, run evals to test a skill, or benchmark skill performance with variance analysis.
Professional skill creation with TDD workflow. Features dual-mode (fast/full), behavioral validation, and automated quality gates for 9.0/10+ scores.
Create, test, measure, and iteratively improve Claude Code skills with category-aware design, gotchas-driven development, progressive disclosure coaching, and automated description optimization.
Tools for creating, auditing, and maintaining Claude Code skills. Includes /create-skill for scaffolding, /review-skill for quality checks, and /audit commands for bulk verification. Use when: building new skills, maintaining skill quality, or forking claude-skills repo.
Create and manage Claude Code skills, plugins, subagents, and hooks. Use when building new skills, validating existing skills, testing skills empirically, creating plugins, converting projects to plugins, creating hooks, or managing plugin automation. Includes /skills-toolkit:skill-composer, /skills-toolkit:skill-refiner, /skills-toolkit:skill-tester, /skills-toolkit:plugin-creator, /skills-toolkit:subagent-creator, /skills-toolkit:hook-creator, and /skills-toolkit:ask-user-question skills.
Skills that evolve themselves.
A self-evolving skill engine for Claude Code โ create, score, repair, and crystallize skills through autonomous recursive improvement loops.
Claude Code skills are static. You write them once, and they stay exactly as they are โ even when they fail, produce subpar output, or encounter new edge cases. There's no feedback loop, no way to know which skills are working well, and no mechanism to improve them over time.
You end up with a growing pile of skills where some work great, some are mediocre, and some are silently broken. The only way to fix them is manual review.
singularity-claude adds a recursive evolution loop to your skills:
graph TD
Goal["๐ฏ Goal:<br/>Set up API proxy endpoints"]
Goal --> HasSkill{Skill exists?}
HasSkill -->|No| Gap["๐ Gap detected:<br/>recurring pattern"]
Gap --> Create["โ๏ธ /singularity-create\nGather requirements\nGenerate SKILL.md\nRegister + init scoring"]
Create --> Execute
HasSkill -->|Yes| Execute["โก Execute skill"]
Execute --> Score["๐ /singularity-score\nDispatch assessor agent\nRate 5 dimensions ยท 0-100"]
Score --> Maturity["๐ Auto-update maturity\ndraft โ tested โ hardened"]
Maturity --> Low{Avg < 50\n2+ runs?}
Low -->|Yes| Repair["๐ง /singularity-repair\nAnalyze telemetry\nRewrite weak dimensions\nBump version ยท re-test"]
Repair --> Execute
Low -->|No| High{Avg โฅ 90 ยท 5+ runs\nhardened ยท edge cases?}
High -->|Yes| Crystal["๐ /singularity-crystallize\nGit tag + lock version"]
Crystal --> Done["โ
Production-grade skill\nImmutable ยท rollback-safe"]
High -->|No| Loop["๐ Keep using + scoring"]
Loop --> Execute
style Goal fill:#1d3557,color:#fff,stroke:#0d1b2a
style Gap fill:#e9ecef,color:#000,stroke:#adb5bd
style Create fill:#457b9d,color:#fff,stroke:#1d3557
style Execute fill:#2a9d8f,color:#fff,stroke:#1a7a6e
style Score fill:#e76f51,color:#fff,stroke:#c1440e
style Maturity fill:#264653,color:#fff,stroke:#1d3540
style Low fill:#fff3cd,color:#000,stroke:#ffc107
style High fill:#d4edda,color:#000,stroke:#28a745
style Repair fill:#f4a261,color:#000,stroke:#e76f51
style Crystal fill:#2d6a4f,color:#fff,stroke:#1b4332
style Done fill:#1b4332,color:#fff,stroke:#0d2818
style Loop fill:#a8dadc,color:#000,stroke:#457b9d
Tip: Click the expand button (โ) on the diagram to see the full interactive flow.
Skills are scored after every execution. Low scores trigger automatic repair. High scores lead to crystallization โ a locked, battle-tested version you can trust. Every step is logged for full auditability.
No external dependencies. No SmythOS, no OpenTelemetry collector, no Docker. Pure Claude Code.
| Skill | Command | Purpose |
|---|---|---|
| using-singularity | (auto-loaded) | Bootstrap context + capability gap detection |
| creating-skills | /singularity-create | Build new skills through a structured workflow |
| scoring | /singularity-score | Rate execution on a 5-dimension rubric (0-100) |
| repairing | /singularity-repair | Auto-fix failing skills by analyzing score history |
| crystallizing | /singularity-crystallize | Lock validated versions via git tags |
| reviewing | /singularity-review | Health check with trend analysis |
| dashboard | /singularity-dashboard | Overview of all managed skills |
| Agent | Model | Purpose |
|---|---|---|
| skill-assessor | Haiku | Fast, cheap automated scoring against the rubric |
| gap-detector | Haiku | Analyzes failed tasks to find missing skill capabilities |
| Script | Purpose |
|---|---|
score-manager.sh | CLI for reading/writing score JSON files (atomic writes, jq/node fallback) |
telemetry-writer.sh | CLI for structured execution logging with replay support |
Requires Claude Code CLI
Inside a Claude Code session, run: