Skill

monitoring-alerting

Firebase Crashlytics, Performance Monitoring, Cloud Logging, Cloud Monitoring alerts, and uptime checks

Install

Run in your terminal

npx claudepluginhub javimontano/jm-adk-alfa

Tool Access

This skill uses the workspace's default tool permissions.

Supporting Assets

View in Repository

agents/guardian.md

agents/lead.md

agents/specialist.md

agents/support.md

evals/evals.json

knowledge/body-of-knowledge.md

knowledge/knowledge-graph.md

prompts/meta.md

prompts/primary.md

prompts/variations/deep.md

prompts/variations/quick.md

templates/output.docx.md

templates/output.html

Skill Content

092 — Monitoring & Alerting {DevOps}

Purpose

Establish comprehensive observability across the application stack. Detect crashes, performance degradation, and downtime proactively through Firebase and Google Cloud monitoring tools. [EXPLICIT]

Physics — 3 Immutable Laws

Law of Observability: If it's not monitored, it's not in production. Every service (hosting, functions, firestore, auth) has active monitoring. [EXPLICIT]
Law of Alert Fatigue Prevention: Alerts fire only on actionable conditions. No informational alerts — only conditions requiring human intervention. [EXPLICIT]
Law of Mean Time to Detect: MTTD < 5 minutes for critical issues. Uptime checks run every 60 seconds. Crash reports arrive in real-time. [EXPLICIT]

Protocol

Phase 1 — Crash & Error Monitoring

Enable Firebase Crashlytics for web: import { getPerformance } from 'firebase/performance'. [EXPLICIT]
Configure global error boundary in React to report uncaught errors. [EXPLICIT]
Cloud Functions: structured logging with functions.logger — errors auto-surface in Cloud Logging. [EXPLICIT]
Set up Cloud Logging sink for error-level logs → alert channel. [EXPLICIT]

Phase 2 — Performance Monitoring

Initialize Firebase Performance Monitoring SDK in app entry point. [EXPLICIT]
Add custom traces for critical operations: perf.trace('checkout-flow'). [EXPLICIT]
Monitor Cloud Functions execution time via Cloud Monitoring metrics. [EXPLICIT]
Set performance budgets: page load < 3s, function execution < 10s. [EXPLICIT]

Phase 3 — Alerts & Uptime

Cloud Monitoring uptime check: HTTPS GET on production URL every 60s. [EXPLICIT]
Alert policies: error rate > 1% → email + Slack. Uptime check fails 2x → page. [EXPLICIT]
Firestore usage alert: reads > 50K/day or writes > 10K/day → email. [EXPLICIT]
Billing alert: projected spend > budget threshold → email + pause non-critical functions. [EXPLICIT]

I/O

Input	Output
Application errors/crashes	Crashlytics dashboard + alert
Page loads and custom traces	Performance Monitoring dashboard
Cloud Functions logs	Cloud Logging queries + alerts
Production URL	Uptime check status (up/down)

Quality Gates — 5 Checks

Crashlytics enabled — crash-free rate visible in Firebase Console. [EXPLICIT]
Uptime check active — 60-second interval on production URL. [EXPLICIT]
Alert channels configured — email + Slack/PagerDuty for critical alerts. [EXPLICIT]
Billing alerts set — threshold at 80% of monthly budget. [EXPLICIT]
Error rate dashboard exists — real-time error rate visible to team. [EXPLICIT]

Edge Cases

Crash in error boundary: Implement fallback logging (beacon API) for catastrophic failures.
Cold start noise: Exclude first invocation from function performance baselines.
Alert storms: Set alert cooldown period (5 min) to prevent notification flooding.
Third-party outages: Monitor external API dependencies separately from app health.

Self-Correction Triggers

Crash-free rate drops below 99.5% → immediate investigation and hotfix.
Uptime check fails → verify hosting, DNS, SSL certificate status.
Performance regression detected → run Lighthouse audit (skill 096).
Billing spike → audit Firestore queries (skill 100), check for infinite loops in functions.

Usage

Example invocations:

"/monitoring-alerting" — Run the full monitoring alerting workflow
"monitoring alerting on this project" — Apply to current context

Assumptions & Limits

Assumes access to project artifacts (code, docs, configs) [EXPLICIT]
Requires English-language output unless otherwise specified [EXPLICIT]
Does not replace domain expert judgment for final decisions [EXPLICIT]

Similar Skills

skill-lookup

Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.

prompts.chat

157.5k

prompt-lookup

Searches prompts.chat for AI prompt templates by keyword or category, retrieves by ID with variable handling, and improves prompts via AI. Use for discovering or enhancing prompts.

prompts.chat

157.5k

agent-eval

Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.

everything-claude-code

138.0k

Stats

Stars1

Forks0

Last CommitMar 28, 2026

Actions

View Source View Plugin View on GitHub View README

Physics — 3 Immutable Laws

Law of Observability: If it's not monitored, it's not in production. Every service (hosting, functions, firestore, auth) has active monitoring. [EXPLICIT]

Law of Alert Fatigue Prevention: Alerts fire only on actionable conditions. No informational alerts — only conditions requiring human intervention. [EXPLICIT]

Law of Mean Time to Detect: MTTD < 5 minutes for critical issues. Uptime checks run every 60 seconds. Crash reports arrive in real-time. [EXPLICIT]

Protocol

Phase 1 — Crash & Error Monitoring

Enable Firebase Crashlytics for web: import { getPerformance } from 'firebase/performance'. [EXPLICIT]

Configure global error boundary in React to report uncaught errors. [EXPLICIT]

Cloud Functions: structured logging with functions.logger — errors auto-surface in Cloud Logging. [EXPLICIT]

Set up Cloud Logging sink for error-level logs → alert channel. [EXPLICIT]

Phase 2 — Performance Monitoring

Initialize Firebase Performance Monitoring SDK in app entry point. [EXPLICIT]

Add custom traces for critical operations: perf.trace('checkout-flow'). [EXPLICIT]

Monitor Cloud Functions execution time via Cloud Monitoring metrics. [EXPLICIT]

Set performance budgets: page load < 3s, function execution < 10s. [EXPLICIT]

Phase 3 — Alerts & Uptime

Cloud Monitoring uptime check: HTTPS GET on production URL every 60s. [EXPLICIT]

Alert policies: error rate > 1% → email + Slack. Uptime check fails 2x → page. [EXPLICIT]

Firestore usage alert: reads > 50K/day or writes > 10K/day → email. [EXPLICIT]

Billing alert: projected spend > budget threshold → email + pause non-critical functions. [EXPLICIT]

Input

Output

Application errors/crashes

Crashlytics dashboard + alert

Page loads and custom traces

Performance Monitoring dashboard

Cloud Functions logs

Cloud Logging queries + alerts

Production URL

Uptime check status (up/down)

Quality Gates — 5 Checks

Crashlytics enabled — crash-free rate visible in Firebase Console. [EXPLICIT]

Uptime check active — 60-second interval on production URL. [EXPLICIT]

Alert channels configured — email + Slack/PagerDuty for critical alerts. [EXPLICIT]

Billing alerts set — threshold at 80% of monthly budget. [EXPLICIT]

Error rate dashboard exists — real-time error rate visible to team. [EXPLICIT]

Edge Cases

Crash in error boundary: Implement fallback logging (beacon API) for catastrophic failures.

Cold start noise: Exclude first invocation from function performance baselines.

Alert storms: Set alert cooldown period (5 min) to prevent notification flooding.

Third-party outages: Monitor external API dependencies separately from app health.

Self-Correction Triggers

Crash-free rate drops below 99.5% → immediate investigation and hotfix.

Uptime check fails → verify hosting, DNS, SSL certificate status.

Performance regression detected → run Lighthouse audit (skill 096).

Billing spike → audit Firestore queries (skill 100), check for infinite loops in functions.