Plugin

agent-artifex

Name: agent-artifex
Author: flexion

Agent Artifex — designing and testing MCP servers, agents, chatbots, and tool-calling systems using evidence-based patterns from Anthropic, OpenAI, RAGAS, and academic research

npx claudepluginhub flexion/claude-domestique --plugin agent-artifex

Component Overview

Skills

Component Details

Skills (6)

agent-artifex:assess

/assess

Use when the user asks "what testing do we need?", "what are our testing gaps?", "we have some tests but are they enough?", "is our MCP server well-tested?", "what should we test next?", "audit our test coverage for AI", "we keep getting bad responses and don't know why", "our agent picks the wrong tool sometimes", "how is my tool design?", "are my descriptions good enough?", "review my error messages", "is my system prompt well-designed?", "audit my MCP server design", "what design gaps do I have?", or needs to diagnose AI design or testing gaps in an existing project. Also use when someone says "assess my testing", "review our test strategy for AI", or "assess my design".

agent-artifex:design

/design

Use when the user wants to design an MCP server, agent, chatbot, or tool-calling system for quality. This includes: designing tool descriptions, structuring parameters and schemas, writing error messages for LLM consumers, designing system prompts, planning multi-turn conversations, architecting tool sets, or designing response formats. Also use when someone says "how should I design", "what makes a good tool description", "how should I structure my errors", "design my MCP server", "how do I organize my tools", or any task where they want to follow evidence-based design principles before or while building.

agent-artifex:foundations

/foundations

Use when the user asks "what is the AI testing framework?", "what are the design principles?", "explain the 7 design areas", "explain MCP testing", "what are the testing areas?", "what's the testing pyramid for AI?", "how do the testing layers relate?", "what should I test in my MCP server?", "overview of AI agent design and testing", or needs a comprehensive reference overview of the AI design and testing guidelines. Also use when someone is new to designing or testing AI systems and wants the big picture before diving into implementation.

agent-artifex:guide

/guide

Use when the user is unsure which AI services skill to invoke, asks for "AI services guidance" generally, says "help me design my MCP server", "how should I structure my tools?", "what makes a good tool description?", "how do I design my agent?", "help me test my MCP server", "how should I test my agent?", "what testing do I need?", "my tests are flaky", "the agent picks the wrong tool", "how do I set up CI for AI tests?", or wants a structured guided experience that routes across multiple AI services skills. Also use when the user mentions designing or testing chatbots, tool descriptions, evals, or agent behavior without specifying a particular skill.

agent-artifex:implement

/implement

Use when the user wants to improve an existing MCP server, agent, chatbot, or tool-calling system. This includes: improving tool descriptions, fixing error messages, adding output schemas, writing tests, implementing quality checks, adding evals, setting up test harnesses, or any task where they say "help me improve", "fix my descriptions", "add tests", "write evals", "implement quality checks", "make my server better", "apply the design principles", or are ready to make code changes to improve quality. This skill covers both design application (making the code better) and test implementation (verifying the code is good). For scaffolding new projects, use claude-api:mcp-builder. For design principles without code changes, use agent-artifex:design.

agent-artifex:learn

/learn

Use when the user says "I'm new to AI testing", "teach me about designing MCP servers", "how do I design good tool descriptions?", "walk me through design principles", "explain the design areas", "explain the testing pyramid for agents", "how do I test tool descriptions?", "walk me through an example", "I read the docs but it's not clicking", "how do evals work?", "what's faithfulness in AI testing?", "explain the causal chain", or wants to build fluency in AI services design and testing through Socratic dialogue rather than reading reference material.

README

Claude Domestique

Your strategic coding partner.

Like a cycling domestique, it carries the water, stays focused on your goals, and handles the unglamorous work you don't want to do.

The Plugins

memento — Session Persistence

"Remember Sammy Jankis."

Like Leonard in Memento, Claude can't form long-term memories. Context window fills up, conversation resets, everything vanishes. memento gives Claude its tattoos—session files that persist decisions, progress, and context across resets.

Helps developers embody Flexion fundamentals across conversation boundaries:

Lead by example — Persists decisions and progress so nothing is lost to "fixed bug" amnesia
Empower customers to adapt — Enables team handoffs with full context of what was done and why
Design as you go — Captures evolving understanding as key details emerge during work

mantra — Behavioral Rules

"I told you. You agreed. You forgot. Repeat."

You've written the perfect CLAUDE.md. Claude reads it. Claude agrees. By turn 47, Claude ignores half of it. mantra provides curated behavioral rules that are automatically loaded via Claude Code's native .claude/rules/ mechanism—ensuring consistent behavior from turn 1 to turn 100.

Helps developers embody Flexion fundamentals throughout long sessions:

Be skeptical and curious — Keeps Claude questioning assumptions and seeking evidence, not pattern-matching
Never compromise on quality — Maintains consistent standards from turn 1 to turn 100
Listen with humility — Enforces peer-not-subordinate stance, deferring to evidence over agreement

onus — Work-Item Automation

"The burden is mine now."

JIRA tickets, Azure DevOps work items, commit messages, PR descriptions. The awful-but-important work that kills your flow. onus handles the project management bureaucracy so you can code.

Supports GitHub Issues (default, zero-config), JIRA, and Azure DevOps work items.

Helps developers embody Flexion fundamentals while staying in flow:

Never compromise on quality — Ensures proper commit messages, ticket updates, and PR descriptions
Lead by example — Handles PM accountability work so it actually gets done
Empower customers to adapt — Keeps stakeholders informed via trackers without breaking developer focus

How They Work Together

External (GitHub/JIRA/Azure DevOps)
        │
        ▼ fetch issue details
    [onus]
        │
        ▼ populate session file
    [memento] ←── "What's next?" lookup
        │
        ▼ read session context
    [mantra] ──► native rules auto-loaded

Each plugin works standalone but gains enhanced behavior when used together.

All three plugins working together

Session resumption showing mantra (context refresh counter), onus (issue tracking), and memento (session file) working together. Claude reads the session file and picks up exactly where the previous conversation left off.

Requirements

Required

Tool	Version	Used By	Purpose
Claude Code	2.0.12+	All	Plugin host (plugin system introduced in v2.0.12)
Node.js	18+	All	Runtime for hooks and scripts
git	2.x	All	Branch detection, commits, session tracking

Platform-Specific (onus)

When using /onus:fetch, Claude will use these tools to retrieve work items:

Tool	Platform	Purpose
GitHub CLI (gh)	GitHub	Fetch issues, create PRs (recommended)
Claude's WebFetch	JIRA, Azure DevOps	Built-in HTTP tool for API calls

Note: For JIRA/Azure DevOps, Claude uses its built-in WebFetch capability. No additional tools required.

Environment Variables (for onus)

Variable	Platform	How to Get
`GITHUB_TOKEN`	GitHub	Create PAT with `repo` scope
`JIRA_TOKEN`	JIRA	`echo -n "email:api_token" \| base64` (Get API token)
`AZURE_DEVOPS_TOKEN`	Azure DevOps	`echo -n ":pat" \| base64` (Create PAT with Work Items Read)

Verification

# Check required tools
git --version          # git version 2.x
node --version         # v18.x or higher
claude --version       # Claude Code 2.x

# Check GitHub CLI (optional, for onus with GitHub)
gh --version           # gh version 2.x
gh auth status         # Verify authentication

Installation

Add the Marketplace

/plugin marketplace add flexion/claude-domestique

Install Plugins

View full README on GitHub

Similar Plugins

fullstack-dev-skills

8.6k

224

Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.

Stats

Version1.0.0

Parent Repo Stars0

MaintenanceGood

LicenseMIT

AddedMar 9, 2026

Actions

View on GitHub View README Plugin Marketplace JSON Homepage

Available In

claude-domestique

Claude Domestique

Your strategic coding partner.

Like a cycling domestique, it carries the water, stays focused on your goals, and handles the unglamorous work you don't want to do.

The Plugins

memento — Session Persistence

"Remember Sammy Jankis."

Helps developers embody Flexion fundamentals across conversation boundaries:

Lead by example — Persists decisions and progress so nothing is lost to "fixed bug" amnesia
Empower customers to adapt — Enables team handoffs with full context of what was done and why
Design as you go — Captures evolving understanding as key details emerge during work

mantra — Behavioral Rules

"I told you. You agreed. You forgot. Repeat."

Helps developers embody Flexion fundamentals throughout long sessions:

Be skeptical and curious — Keeps Claude questioning assumptions and seeking evidence, not pattern-matching
Never compromise on quality — Maintains consistent standards from turn 1 to turn 100
Listen with humility — Enforces peer-not-subordinate stance, deferring to evidence over agreement

onus — Work-Item Automation

"The burden is mine now."

JIRA tickets, Azure DevOps work items, commit messages, PR descriptions. The awful-but-important work that kills your flow. onus handles the project management bureaucracy so you can code.

Supports GitHub Issues (default, zero-config), JIRA, and Azure DevOps work items.

Helps developers embody Flexion fundamentals while staying in flow:

Never compromise on quality — Ensures proper commit messages, ticket updates, and PR descriptions
Lead by example — Handles PM accountability work so it actually gets done
Empower customers to adapt — Keeps stakeholders informed via trackers without breaking developer focus

How They Work Together

External (GitHub/JIRA/Azure DevOps)
        │
        ▼ fetch issue details
    [onus]
        │
        ▼ populate session file
    [memento] ←── "What's next?" lookup
        │
        ▼ read session context
    [mantra] ──► native rules auto-loaded

Each plugin works standalone but gains enhanced behavior when used together.

All three plugins working together

Requirements

Required

Tool	Version	Used By	Purpose
Claude Code	2.0.12+	All	Plugin host (plugin system introduced in v2.0.12)
Node.js	18+	All	Runtime for hooks and scripts
git	2.x	All	Branch detection, commits, session tracking

Platform-Specific (onus)

When using /onus:fetch, Claude will use these tools to retrieve work items:

Tool	Platform	Purpose
GitHub CLI (gh)	GitHub	Fetch issues, create PRs (recommended)
Claude's WebFetch	JIRA, Azure DevOps	Built-in HTTP tool for API calls

Note: For JIRA/Azure DevOps, Claude uses its built-in WebFetch capability. No additional tools required.

Environment Variables (for onus)

Variable	Platform	How to Get
`GITHUB_TOKEN`	GitHub	Create PAT with `repo` scope
`JIRA_TOKEN`	JIRA	`echo -n "email:api_token" \| base64` (Get API token)
`AZURE_DEVOPS_TOKEN`	Azure DevOps	`echo -n ":pat" \| base64` (Create PAT with Work Items Read)

Verification

# Check required tools
git --version          # git version 2.x
node --version         # v18.x or higher
claude --version       # Claude Code 2.x

# Check GitHub CLI (optional, for onus with GitHub)
gh --version           # gh version 2.x
gh auth status         # Verify authentication

Installation

Add the Marketplace

/plugin marketplace add flexion/claude-domestique

agent-artifex

Component Overview

Component Details

Skills (6)

README

Claude Domestique

The Plugins

memento — Session Persistence

mantra — Behavioral Rules

onus — Work-Item Automation

How They Work Together

Requirements

Required

Platform-Specific (onus)

Environment Variables (for onus)

Verification

Installation

Add the Marketplace

Install Plugins

Similar Plugins

fullstack-dev-skills

agent-artifex

Component Overview

Component Details

Skills (6)

README

Claude Domestique

The Plugins

memento — Session Persistence

mantra — Behavioral Rules

onus — Work-Item Automation

How They Work Together

Requirements

Required

Platform-Specific (onus)

Environment Variables (for onus)

Verification

Installation

Add the Marketplace

Install Plugins

Similar Plugins

fullstack-dev-skills

everything-claude-code

dotnet-skills

godot-skills

everything-claude-code

unity-dev-toolkit