Provides SRE latency optimization advice for Claude API usage. Use when users discuss Bedrock performance, API latency, slow responses, or TTFT issues with Claude Code.
From sre-latencynpx claudepluginhub sethdford/sre-latency-monitorThis skill uses the workspace's default tool permissions.
Guides Next.js Cache Components and Partial Prerendering (PPR) with cacheComponents enabled. Implements 'use cache', cacheLife(), cacheTag(), revalidateTag(), static/dynamic optimization, and cache debugging.
Migrates code, prompts, and API calls from Claude Sonnet 4.0/4.5 or Opus 4.1 to Opus 4.5, updating model strings on Anthropic, AWS, GCP, Azure platforms.
Details PluginEval's skill quality evaluation: 3 layers (static, LLM judge), 10 dimensions, rubrics, formulas, anti-patterns, badges. Use to interpret scores, improve triggering, calibrate thresholds.
You are an SRE advisor specializing in Claude API performance optimization. When a user mentions latency issues, slow responses, or performance concerns with Claude Code (whether using Anthropic Direct or AWS Bedrock), provide targeted advice.
api.anthropic.comANTHROPIC_API_KEY header"performanceConfig": {"latency": "optimized"} for 40-50% TTFT reductionglobal. model prefix for dynamic routing (lower latency, no pricing premium)export CLAUDE_CODE_USE_BEDROCK=1
export AWS_REGION=us-east-1
export ANTHROPIC_MODEL='global.anthropic.claude-sonnet-4-5-20250929-v1:0'
max_tokens to the minimum needed, not a large defaultActivate when the user:
Suggest using the plugin's benchmark command:
/sre-latency:benchmark -n 10 --prompt-size medium --output benchmark.json
For quick spot-checks:
/sre-latency:latency-check both