Plugin

orq

Name: orq
Author: orq-ai

By orq-ai

Agent skills for building, deploying, evaluating, and monitoring LLM pipelines on the orq.ai platform.

Component Overview

/analytics, /models +3

Commands

orq-assistant

Agents

analyze-trace-failures, build-agent +7

Skills

Hooks

orq-workspace

MCP Servers

LSP Servers

Output Styles

Install

npx claudepluginhub orq-ai/assistant-plugins

Component Details

Commands (5)

analytics

/analytics

Show workspace analytics — requests, cost, tokens, errors, top models, and drill-down trends

models

/models

List available AI models and their capabilities

quickstart

/quickstart

Interactive onboarding guide — set up credentials, connect to orq.ai, and learn every command and skill

traces

/traces

Query and summarize traces with filters — debugging entry point before analyze-trace-failures

workspace

/workspace

Show a workspace overview — agents, deployments, prompts, datasets, experiments, projects, and knowledge bases

Agents (1)

orq-assistant

/AGENTS

orq.ai workspace assistant — routes to skills and commands for building, evaluating, and monitoring LLM pipelines

Skills (9)

analyze-trace-failures

/analyze-trace-failures

Read production traces, identify what's failing, and build failure taxonomies using open coding and axial coding methodology. Use when debugging agent or pipeline quality, investigating "why are my outputs bad?", or before building any evaluator — error analysis must come first. Do NOT use when you already have identified failure modes and need evaluators (use build-evaluator) or datasets (use generate-synthetic-dataset).

build-agent

/build-agent

Design, create, and configure orq.ai Agents with tools, instructions, knowledge bases, and memory stores. Use when building new agents, attaching KBs or memory, writing system instructions, selecting models, or setting up RAG pipelines. Do NOT use for debugging existing agents (use analyze-trace-failures) or comparing agents across frameworks (use compare-agents).

build-evaluator

/build-evaluator

Create validated LLM-as-a-Judge evaluators following best practices — binary Pass/Fail judges with TPR/TNR validation for measuring specific failure modes. Use when you need to automate quality checks, build guardrails, or measure a specific failure mode identified during trace analysis. Do NOT use when failures are fixable with prompt changes (use optimize-prompt) or when failure modes are unknown (use analyze-trace-failures first).

compare-agents

/compare-agents

Run cross-framework agent comparisons using evaluatorq from orqkit — compares any combination of agents (orq.ai, LangGraph, CrewAI, OpenAI Agents SDK, Vercel AI SDK) head-to-head on the same dataset with LLM-as-a-judge scoring. Use when comparing agents, benchmarking, or wanting side-by-side evaluation. Do NOT use when comparing only orq.ai configurations with no external agents (use run-experiment instead).

generate-synthetic-dataset

/generate-synthetic-dataset

Generate and curate evaluation datasets — structured generation via dimensions-tuples-NL, quick from description, expansion from existing data, plus dataset maintenance through deduplication, rebalancing, and gap-filling. Use when creating eval data, expanding test coverage, or cleaning datasets. Do NOT use when sufficient real production data exists (use analyze-trace-failures instead). Do NOT use for evaluator creation (use build-evaluator).

invoke-deployment

/invoke-deployment

Invoke orq.ai deployments, agents, and models via the Python SDK or HTTP API. Use when a user wants to call a deployment with prompt variables, invoke an agent in a conversation, or call a model directly through the AI Router. Do NOT use for creating or editing deployments/agents (use optimize-prompt or build-agent). Do NOT use for running evaluations (use run-experiment).

optimize-prompt

/optimize-prompt

Analyze and optimize system prompts using a structured prompting guidelines framework — AI-powered analysis and rewriting. Use when a prompt needs improvement, experiment results show quality gaps, or you want a structured review of an existing system prompt. Do NOT use when production traces show failures (use analyze-trace-failures first to identify patterns). Do NOT use to build evaluators (use build-evaluator).

run-experiment

/run-experiment

Create and run orq.ai experiments — compare configurations against datasets using evaluators, analyze results, and generate prioritized action plans. Use when evaluating LLM agents, deployments, conversations, or RAG pipelines end-to-end. Do NOT use without a dataset and evaluators. Do NOT use for cross-framework comparisons with external agents (use compare-agents).

setup-observability

/setup-observability

Set up orq.ai observability for LLM applications. Use when setting up tracing, adding the AI Router proxy, integrating OpenTelemetry, auditing existing instrumentation, or enriching traces with metadata.

MCP Servers (1)

Connects to external services

orq-workspace

External

README

Orq.ai Agent Skills

Agent Skills for the full Build → Evaluate → Optimize lifecycle of LLM pipelines on orq.ai.

Skills are multi-step workflows that require reasoning (e.g. build an agent, run an experiment);

Commands are quick actions for immediate results (list traces, show analytics).

Each skill encodes best practices from prompt engineering, agent design, evaluation methodology, and experimentation into repeatable workflows. From creating agents and writing prompts, through trace analysis and dataset generation, to running validated experiments and iterating on results.

Built on the Agent Skills standard format, so it works with any compatible agent (Claude Code, Cursor, Gemini CLI, and others).

Setup

Prerequisites

An orq.ai account
An API key from Settings → API Keys
```
export ORQ_API_KEY=your-key-here
```

Claude Code plugin

Use this if you want easy access to all components — skills, MCP tools, and trace hooks — in one install. Installed via the orq-ai/claude-plugins marketplace.

# In Claude Code:
/plugin marketplace add orq-ai/claude-plugins

# Install all 3 plugins
/plugin install orq-skills@orq-claude-plugin
/plugin install orq-mcp@orq-claude-plugin
/plugin install orq-trace@orq-claude-plugin

Plugin	What it gives you
`orq-skills`	Skills, commands, and agents for the Build → Evaluate → Optimize lifecycle
`orq-mcp`	MCP server registration — Claude can call orq.ai APIs directly
`orq-trace`	OTLP tracing hooks that capture Claude Code sessions into orq.ai

Verify with the interactive onboarding — checks ORQ_API_KEY, MCP reachability, and credentials:

/orq:quickstart

Skills-only install

Use this when you're on a non-Claude agent (Cursor, Gemini CLI, Cline, Copilot CLI, Codex, Windsurf, and many others), or when you only want the skills without MCP/trace hooks.

npx skills add orq-ai/orq-skills

Auto-detects your agent and writes skills to the correct location (e.g. .claude/skills/, .cursor/rules/). Run inside your project directory.

Agent-specific install guides:

MCP-only install

Use this when you want orq.ai MCP tools in a tool that isn't the Claude Code plugin (Claude Desktop, other MCP-capable clients, or manual Claude Code setup).

# Manual registration in Claude Code
claude mcp add --transport http orq-workspace https://my.orq.ai/v2/mcp \
  --header "Authorization: Bearer ${ORQ_API_KEY}"

For other clients, most accept a JSON block with url + headers:

{
  "mcpServers": {
    "orq-workspace": {
      "type": "http",
      "url": "https://my.orq.ai/v2/mcp",
      "headers": { "Authorization": "Bearer ${ORQ_API_KEY}" }
    }
  }
}

Manifest validation

tests/scripts/validate-plugin-manifests.sh

Commands

Quick-action slash commands. Use /orq:<command> in Claude Code.

Command	What It Does	Usage
quickstart	Interactive onboarding — credentials, MCP setup, skills tour	`/orq:quickstart`
workspace	Workspace overview — agents, deployments, prompts, datasets, experiments	`/orq:workspace [section]`
traces	Query and summarize traces with filters	`/orq:traces [--deployment name] [--status error] [--last 24h]`
models	List available AI models by provider	`/orq:models [search-term]`
analytics	Usage analytics — requests, cost, tokens, errors	`/orq:analytics [--last 24h] [--group-by model]`

Examples

/orq:workspace agents          # Show only agents
/orq:traces --status error --last 1h   # Recent errors
/orq:models gpt-4              # Search for GPT-4 variants
/orq:analytics --group-by deployment    # Cost per deployment

Skills

Skills are triggered by describing what you need. Claude picks the right skill automatically.

View full README on GitHub

Similar Plugins

team-skills-platform

163.7k

1.4K

Team-oriented workflow plugin with role agents, 27 specialist agents, ECC-inspired commands, layered rules, and hooks skeleton.

Stats

Version0.0.2

Stars0

MaintenanceExcellent

LicenseMIT

AddedApr 8, 2026

Actions

View on GitHub View README Plugin Marketplace JSON Homepage

Safety Signals

Caution

External network access

Connects to servers outside your machine

Uses power tools

Uses Bash, Write, or Edit tools

Orq.ai Agent Skills

Agent Skills for the full Build → Evaluate → Optimize lifecycle of LLM pipelines on orq.ai.

Skills are multi-step workflows that require reasoning (e.g. build an agent, run an experiment);

Commands are quick actions for immediate results (list traces, show analytics).

Built on the Agent Skills standard format, so it works with any compatible agent (Claude Code, Cursor, Gemini CLI, and others).

Setup

Prerequisites

An orq.ai account
An API key from Settings → API Keys
```
export ORQ_API_KEY=your-key-here
```

Claude Code plugin

Use this if you want easy access to all components — skills, MCP tools, and trace hooks — in one install. Installed via the orq-ai/claude-plugins marketplace.

# In Claude Code:
/plugin marketplace add orq-ai/claude-plugins

# Install all 3 plugins
/plugin install orq-skills@orq-claude-plugin
/plugin install orq-mcp@orq-claude-plugin
/plugin install orq-trace@orq-claude-plugin

Plugin	What it gives you
`orq-skills`	Skills, commands, and agents for the Build → Evaluate → Optimize lifecycle
`orq-mcp`	MCP server registration — Claude can call orq.ai APIs directly
`orq-trace`	OTLP tracing hooks that capture Claude Code sessions into orq.ai

Verify with the interactive onboarding — checks ORQ_API_KEY, MCP reachability, and credentials:

/orq:quickstart

Skills-only install

Use this when you're on a non-Claude agent (Cursor, Gemini CLI, Cline, Copilot CLI, Codex, Windsurf, and many others), or when you only want the skills without MCP/trace hooks.

npx skills add orq-ai/orq-skills

Auto-detects your agent and writes skills to the correct location (e.g. .claude/skills/, .cursor/rules/). Run inside your project directory.

Agent-specific install guides:

MCP-only install

Use this when you want orq.ai MCP tools in a tool that isn't the Claude Code plugin (Claude Desktop, other MCP-capable clients, or manual Claude Code setup).

# Manual registration in Claude Code
claude mcp add --transport http orq-workspace https://my.orq.ai/v2/mcp \
  --header "Authorization: Bearer ${ORQ_API_KEY}"

For other clients, most accept a JSON block with url + headers:

{
  "mcpServers": {
    "orq-workspace": {
      "type": "http",
      "url": "https://my.orq.ai/v2/mcp",
      "headers": { "Authorization": "Bearer ${ORQ_API_KEY}" }
    }
  }
}

Manifest validation

tests/scripts/validate-plugin-manifests.sh

Commands

Quick-action slash commands. Use /orq:<command> in Claude Code.

Command	What It Does	Usage
quickstart	Interactive onboarding — credentials, MCP setup, skills tour	`/orq:quickstart`
workspace	Workspace overview — agents, deployments, prompts, datasets, experiments	`/orq:workspace [section]`
traces	Query and summarize traces with filters	`/orq:traces [--deployment name] [--status error] [--last 24h]`
models	List available AI models by provider	`/orq:models [search-term]`
analytics	Usage analytics — requests, cost, tokens, errors	`/orq:analytics [--last 24h] [--group-by model]`

Examples

/orq:workspace agents          # Show only agents
/orq:traces --status error --last 1h   # Recent errors
/orq:models gpt-4              # Search for GPT-4 variants
/orq:analytics --group-by deployment    # Cost per deployment

Skills

Skills are triggered by describing what you need. Claude picks the right skill automatically.

orq

Component Overview

Install

Component Details

Commands (5)

Agents (1)

Skills (9)

MCP Servers (1)

README

Orq.ai Agent Skills

Setup

Prerequisites

Claude Code plugin

Skills-only install

MCP-only install

Manifest validation

Commands

Examples

Skills

Similar Plugins

team-skills-platform

orq

Component Overview

Install

Component Details

Commands (5)

Agents (1)

Skills (9)

MCP Servers (1)

README

Orq.ai Agent Skills

Setup

Prerequisites

Claude Code plugin

Skills-only install

MCP-only install

Manifest validation

Commands

Examples

Skills

Similar Plugins

team-skills-platform

context7-plugin

episodic-memory

startup-business-analyst

dotnet-skills

fullstack-dev-skills