Plugin

agentops-accelerator

Name: agentops-accelerator
Author: azure

Run standardized AI agent evaluation workflows with AgentOps Toolkit and Microsoft Foundry, including generating evaluation datasets, running release-readiness gates, scaffolding safety and red-team runners, interpreting reports, managing CI/CD workflows, and surfacing regressions from eval history and Azure Monitor traces.

What's Inside

Skills7

agentops-agent

/agentops-agent

AgentOps Doctor - surface release-readiness findings, regressions, latency spikes, error rates, and safety hits across AgentOps eval history, Azure Monitor traces, and Foundry control plane.

agentops-config

/agentops-config

Generate or update agentops.yaml (flat 1.0 schema) for AgentOps release-readiness gates. Trigger on "configure agentops", "agentops.yaml", "set up evaluation", "what should I evaluate". Infer the agent target and dataset from the codebase; ask only when nothing can be found.

agentops-dataset

/agentops-dataset

Create or extend a small JSONL dataset for AgentOps release-readiness gates. Trigger on "create dataset", "generate test data", "JSONL", "more eval rows". Infer the agent's domain from the codebase and produce realistic rows; never fabricate data when the domain is unclear.

agentops-eval

/agentops-eval

Run AgentOps release-readiness evaluations against Foundry prompt agents, Foundry hosted endpoints, HTTP/JSON agents, or raw model deployments. Trigger on phrases like "run eval", "evaluate my agent", "benchmark", "agentops eval", "compare runs", "can we ship". Uses the flat agentops.yaml schema.

agentops-governance

/agentops-governance

Scaffold ASSERT and Red Team runners for the release gate, and draft reviewable governance evidence for ASSERT, Agent Control Specification (ACS), Guided Guardrail readiness, and red-team planning. Trigger on "ASSERT", "ACS", "agent control", "guardrail", "red team", "governance", "release evidence", "scaffold assert", "set up red team", "add safety gate".

Stats

Version0.8.1

ReleasedJul 16, 2026

LanguagePython

Stars10

Forks8

MaintenanceExcellent

LicenseMIT

Last CommitJul 16, 2026

AddedMay 22, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Available In

agentops10

AgentOps Accelerator

Evaluate. Ship. Observe. Operate.
Continuous evaluation, safety testing, observability, and release readiness for Microsoft Foundry agents.

Documentation | PyPI | VS Code Extension | Latest release

AgentOps Accelerator helps Microsoft Foundry agent teams evaluate quality, prepare releases, monitor behavior, and operate reliably after launch. It gives you a practical starting point for agent operations, with Foundry integration as the default path and deeper setup guidance in the full docs.

Get started

python -m pip install agentops-accelerator
agentops init

agentops init starts a guided setup that creates your agentops.yaml and .agentops/ workspace.

Next, follow the tutorial that matches your agent type:

What it helps you do

Use AgentOps Accelerator when you need to:

Evaluate an agent before release
Compare changes across versions
Capture release evidence
Monitor agent quality and regressions
Give teams a repeatable way to operate agents responsibly in production

The accelerator keeps the local workflow simple, then points you to the full docs when you are ready to configure pipelines, dashboards, and release practices.

Learn more

For setup guides, tutorials, architecture, CI/CD guidance, Doctor checks, and evaluator reference, start with the documentation site:

https://aka.ms/agentops-accelerator

Run a first evaluation

az login
$env:AZURE_AI_FOUNDRY_PROJECT_ENDPOINT = "https://<resource>.services.ai.azure.com/api/projects/<project>"
$env:AZURE_OPENAI_ENDPOINT = "https://<openai-resource>.openai.azure.com"
$env:AZURE_OPENAI_DEPLOYMENT = "gpt-4o-mini"
agentops eval analyze
agentops eval run
agentops doctor --evidence-pack

For Foundry targets, use either project_endpoint: in agentops.yaml or AZURE_AI_FOUNDRY_PROJECT_ENDPOINT. Config wins when both are set.

Outputs land in .agentops/results/latest/:

results.json - machine-readable (versioned, stable schema)
report.md - human-readable, PR-friendly

Release evidence lands in .agentops/release/latest/:

evidence.json - machine-readable production-readiness projection
evidence.md - PR/release summary

Capture the first successful run as a baseline:

New-Item -ItemType Directory -Force .agentops\baseline | Out-Null
Copy-Item .agentops\results\latest\results.json .agentops\baseline\results.json

To see a visible comparison, publish a new agent version with a prompt that paraphrases instead of copying exact-answer requests, update agentops.yaml to that new name:version, and compare against the baseline:

agentops eval run --baseline .agentops/baseline/results.json

The report grows a Comparison vs Baseline section with per-metric deltas.

Commands

Install optional extras as needed: [agent] for Doctor/Cockpit and [mcp] for MCP.

agentops-accelerator

What's Inside

agentops-accelerator

Popularity

What's Inside

Confidence

README

AgentOps Accelerator

Get started

What it helps you do

Learn more

Run a first evaluation

Commands

Similar Plugins

eval-guide

muratcankoylan-evaluation

evalview

agent-eval-harness

evaluate-agent

agentic-usability

More by Azure

azure-connectorgateway

documentdb

azure-functions-skills

amg-toolkit

ai-gateway

AgentOps Accelerator

Get started

What it helps you do

Learn more

Run a first evaluation

Commands

Popularity

Health & Quality

More by Azure

azure-connectorgateway

documentdb

azure-functions-skills

amg-toolkit

ai-gateway

Similar Plugins

eval-guide

muratcankoylan-evaluation

evalview

agent-eval-harness

evaluate-agent

agentic-usability