Plugin

llm-evals

Name: llm-evals
Author: vanman2024

By vanman2024

LLM testing and evaluation framework with promptfoo, DeepEval, golden datasets, and Supabase-backed eval tracking

Component Overview

/add, /build

Commands

dataset-curator, deepeval-specialist +2

Agents

deepeval-testing, eval-tracking +1

Skills

Hooks

MCP Servers

LSP Servers

Output Styles

Themes

Monitors

Install

npx claudepluginhub vanman2024/ai-dev-marketplace --plugin llm-evals

Component Details

Commands (2)

Add LLM Eval Feature

/add

Add a specific eval feature to an existing project. Features include promptfoo, deepeval, golden-dataset, supabase-tracking.

Build LLM Evals System

/build

**Project Name:** `$0`

Agents (4)

dataset-curator

/dataset-curator

Manages golden datasets for LLM evaluation - test case creation, categorization, and version control

deepeval-specialist

/deepeval-specialist

Specializes in DeepEval pytest-style LLM testing with built-in metrics and custom evaluations

eval-orchestrator

/eval-orchestrator

Orchestrates LLM evaluation workflows - coordinates promptfoo, DeepEval, datasets, and tracking

promptfoo-specialist

/promptfoo-specialist

Specializes in promptfoo configuration, prompt regression testing, and multi-provider comparison

Skills (3)

deepeval-testing

/skills/deepeval-testing

DeepEval pytest-style LLM testing patterns with built-in metrics, custom evaluators, and CI integration. Use when creating LLM tests, evaluating RAG quality, or measuring faithfulness/relevance.

eval-tracking

/skills/eval-tracking

Supabase-backed evaluation tracking with runs, cases, and scores tables. Use when storing eval results, building dashboards, or tracking regression over time.

promptfoo-config

/skills/promptfoo-config

promptfoo configuration patterns for prompt regression testing, multi-provider comparison, and assertion-based validation. Use when setting up prompt testing, comparing LLM providers, or creating eval pipelines.

README

AI Development Marketplace

Central repository of 21 Claude Code plugins for AI-powered development - agents, SDKs, frontends, backends, and infrastructure.

Note: The domain-plugin-builder has been moved to its own standalone repository: https://github.com/vanman2024/domain-plugin-builder

What This Is

The ai-dev-marketplace is a collection of Claude Code plugins that provide slash commands, specialized agents, and skills for building AI applications. Each plugin targets a specific technology and can be used independently or combined into full-stack solutions.

Plugins (21 Total)

AI Agent Frameworks

Plugin	Description
`claude-agent-sdk`	Build AI agents with Claude's Agent SDK (TypeScript/Python)
`google-adk`	Google Agent Development Kit - Python, TypeScript, Go, Java
`a2a-protocol`	Agent-to-Agent Protocol for multi-agent interoperability

AI SDKs & Model Access

Plugin	Description
`vercel-ai-sdk`	Modular Vercel AI SDK with streaming, tool-calling, and multi-provider support
`openrouter`	Unified interface for 500+ LLM models with intelligent routing and cost optimization
`elevenlabs`	AI audio - TTS, STT, voice cloning, and Vercel AI SDK integration

AI Memory & RAG

Plugin	Description
`mem0`	AI memory management - Platform (hosted), Open Source (Supabase), MCP (OpenMemory)
`rag-pipeline`	RAG toolkit with LlamaIndex, LangChain, pgvector, Pinecone, Chroma

Machine Learning

Plugin	Description
`ml-training`	ML training/inference on cloud GPUs (Modal, Lambda Labs, RunPod) with HuggingFace

Frontend

Plugin	Description
`nextjs-frontend`	Next.js 15 App Router with AI SDK, Supabase, shadcn/ui, SEO, marketing tools
`sveltekit-frontend`	SvelteKit with Tailwind CSS v4, shadcn-svelte, Bun, HTML-to-Svelte migration
`mobile`	React Native/Expo, PWA, responsive design, EAS Build, app store deployment
`website-builder`	AI-powered sites with Astro, MDX, content-image-generation MCP, Supabase CMS

Backend

Plugin	Description
`fastapi-backend`	Production FastAPI with async/await, Mem0, SQLAlchemy, PostgreSQL
`celery`	Distributed task queue - workers, beat scheduling, Flower monitoring

Data & Infrastructure

Plugin	Description
`supabase`	Database, auth, storage, realtime, pgvector for AI apps
`redis`	Caching, sessions, rate limiting, pub/sub, AI embedding cache

Auth & Payments

Plugin	Description
`clerk`	Authentication with OAuth, organizations, and billing
`payments`	Stripe integration - checkout, subscriptions, webhooks with FastAPI/Next.js/Supabase

Communication

Plugin	Description
`resend`	Email API - transactional, contacts, broadcasts, templates, webhooks

Utilities

Plugin	Description
`plugin-docs-loader`	Universal documentation loading with link extraction and parallel WebFetch

Installation

Clone the Repository

git clone https://github.com/vanman2024/ai-dev-marketplace.git
cd ai-dev-marketplace

Install a Plugin

# From local clone
claude plugin install vercel-ai-sdk --project

# From GitHub directly
claude plugin install vercel-ai-sdk \
  --source github:vanman2024/ai-dev-marketplace/plugins/vercel-ai-sdk

Register as Marketplace

claude marketplace add ai-dev-marketplace \
  --source github:vanman2024/ai-dev-marketplace

claude marketplace list ai-dev-marketplace

Plugin Structure

Each plugin follows a consistent structure:

plugins/{name}/
├── .claude-plugin/
│   └── plugin.json          # Manifest (name, version, description)
├── commands/                # Slash commands (/plugin:command)
├── agents/                  # Specialized AI agents
├── skills/                  # Reusable knowledge/templates
├── docs/                    # Static documentation
└── README.md

Example Stacks

Combine plugins for complete solutions:

AI Chatbot:

vercel-ai-sdk + mem0 + supabase + nextjs-frontend + clerk

SaaS Platform:

nextjs-frontend + supabase + clerk + payments + redis + resend

Multi-Agent System:

claude-agent-sdk + a2a-protocol + celery + redis + supabase

Mobile App:

mobile + supabase + clerk + fastapi-backend

ML Pipeline:

ml-training + rag-pipeline + redis + fastapi-backend + supabase

Building New Plugins

Use the domain-plugin-builder:

/domain-plugin-builder:build-plugin my-plugin

This creates the full plugin structure with commands, agents, and skills.

Related Repositories

View full README on GitHub

Similar Plugins

fullstack-dev-skills

8.6k

200

Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.

Stats

Version1.0.0

Parent Repo Stars2

Parent Repo Forks1

MaintenanceGood

AddedJan 29, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Available In

ai-dev-marketplace2

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

AI Development Marketplace

Central repository of 21 Claude Code plugins for AI-powered development - agents, SDKs, frontends, backends, and infrastructure.

Note: The domain-plugin-builder has been moved to its own standalone repository: https://github.com/vanman2024/domain-plugin-builder

What This Is

Plugins (21 Total)

AI Agent Frameworks

Plugin	Description
`claude-agent-sdk`	Build AI agents with Claude's Agent SDK (TypeScript/Python)
`google-adk`	Google Agent Development Kit - Python, TypeScript, Go, Java
`a2a-protocol`	Agent-to-Agent Protocol for multi-agent interoperability

AI SDKs & Model Access

Plugin	Description
`vercel-ai-sdk`	Modular Vercel AI SDK with streaming, tool-calling, and multi-provider support
`openrouter`	Unified interface for 500+ LLM models with intelligent routing and cost optimization
`elevenlabs`	AI audio - TTS, STT, voice cloning, and Vercel AI SDK integration

AI Memory & RAG

Plugin	Description
`mem0`	AI memory management - Platform (hosted), Open Source (Supabase), MCP (OpenMemory)
`rag-pipeline`	RAG toolkit with LlamaIndex, LangChain, pgvector, Pinecone, Chroma

Machine Learning

Plugin	Description
`ml-training`	ML training/inference on cloud GPUs (Modal, Lambda Labs, RunPod) with HuggingFace

Frontend

Plugin	Description
`nextjs-frontend`	Next.js 15 App Router with AI SDK, Supabase, shadcn/ui, SEO, marketing tools
`sveltekit-frontend`	SvelteKit with Tailwind CSS v4, shadcn-svelte, Bun, HTML-to-Svelte migration
`mobile`	React Native/Expo, PWA, responsive design, EAS Build, app store deployment
`website-builder`	AI-powered sites with Astro, MDX, content-image-generation MCP, Supabase CMS

Backend

Plugin	Description
`fastapi-backend`	Production FastAPI with async/await, Mem0, SQLAlchemy, PostgreSQL
`celery`	Distributed task queue - workers, beat scheduling, Flower monitoring

Data & Infrastructure

Plugin	Description
`supabase`	Database, auth, storage, realtime, pgvector for AI apps
`redis`	Caching, sessions, rate limiting, pub/sub, AI embedding cache

Auth & Payments

Plugin	Description
`clerk`	Authentication with OAuth, organizations, and billing
`payments`	Stripe integration - checkout, subscriptions, webhooks with FastAPI/Next.js/Supabase

Communication

Plugin	Description
`resend`	Email API - transactional, contacts, broadcasts, templates, webhooks

Utilities

Plugin	Description
`plugin-docs-loader`	Universal documentation loading with link extraction and parallel WebFetch

Installation

Clone the Repository

git clone https://github.com/vanman2024/ai-dev-marketplace.git
cd ai-dev-marketplace

Install a Plugin

# From local clone
claude plugin install vercel-ai-sdk --project

# From GitHub directly
claude plugin install vercel-ai-sdk \
  --source github:vanman2024/ai-dev-marketplace/plugins/vercel-ai-sdk

Register as Marketplace

claude marketplace add ai-dev-marketplace \
  --source github:vanman2024/ai-dev-marketplace

claude marketplace list ai-dev-marketplace

Plugin Structure

Each plugin follows a consistent structure:

plugins/{name}/
├── .claude-plugin/
│   └── plugin.json          # Manifest (name, version, description)
├── commands/                # Slash commands (/plugin:command)
├── agents/                  # Specialized AI agents
├── skills/                  # Reusable knowledge/templates
├── docs/                    # Static documentation
└── README.md

Example Stacks

Combine plugins for complete solutions:

AI Chatbot:

vercel-ai-sdk + mem0 + supabase + nextjs-frontend + clerk

SaaS Platform:

nextjs-frontend + supabase + clerk + payments + redis + resend

Multi-Agent System:

claude-agent-sdk + a2a-protocol + celery + redis + supabase

Mobile App:

mobile + supabase + clerk + fastapi-backend

ML Pipeline:

ml-training + rag-pipeline + redis + fastapi-backend + supabase

Building New Plugins

Use the domain-plugin-builder:

/domain-plugin-builder:build-plugin my-plugin

This creates the full plugin structure with commands, agents, and skills.

llm-evals

Component Overview

Install

Component Details

Commands (2)

Agents (4)

Skills (3)

README

AI Development Marketplace

What This Is

Plugins (21 Total)

AI Agent Frameworks

AI SDKs & Model Access

AI Memory & RAG

Machine Learning

Frontend

Backend

Data & Infrastructure

Auth & Payments

Communication

Utilities

Installation

Clone the Repository

Install a Plugin

Register as Marketplace

Plugin Structure

Example Stacks

Building New Plugins

Related Repositories

Similar Plugins

fullstack-dev-skills

llm-evals

Component Overview

Install

Component Details

Commands (2)

Agents (4)

Skills (3)

README

AI Development Marketplace

What This Is

Plugins (21 Total)

AI Agent Frameworks

AI SDKs & Model Access

AI Memory & RAG

Machine Learning

Frontend

Backend

Data & Infrastructure

Auth & Payments

Communication

Utilities

Installation

Clone the Repository

Install a Plugin

Register as Marketplace

Plugin Structure

Example Stacks

Building New Plugins

Related Repositories

Similar Plugins

fullstack-dev-skills

dotnet-skills

pr-review-toolkit

team-skills-platform

context7-plugin

startup-business-analyst