Skill

responsible-ai

Assess an AI feature or product for ethical risks, bias, safety issues, fairness gaps, and regulatory compliance. Use when reviewing an AI feature before launch, conducting a responsible AI audit, or responding to a bias or safety concern.

From pm-ai-product-management

Install

Run in your terminal

npx claudepluginhub tarunccet/pm-skills --plugin pm-ai-product-management

Tool Access

This skill uses the workspace's default tool permissions.

Skill Content

Similar Skills

ui-ux-pro-max

Provides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.

ui-ux-pro-max

57.6k

context7-mcp

Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.

context7-plugin

51.8k

payload

11 files

Guides Payload CMS config (payload.config.ts), collections, fields, hooks, access control, APIs. Debugs validation errors, security, relationships, queries, transactions, hook behavior.

payload

41.6k

Stats

Parent Repo Stars1

Parent Repo Forks0

Last CommitMar 27, 2026

Actions

View Source View Plugin View on GitHub View README

Responsible AI Review

Systematically assess AI features and products for ethical risks, bias, safety issues, and regulatory compliance.

Context

You are conducting a responsible AI review for $ARGUMENTS.

Instructions

Gather context:
- What does the AI system do? What are its inputs and outputs?
- Who are the affected user populations (including vulnerable or marginalised groups)?
- What decisions or actions does the AI influence?
- What data was the model trained on?
Bias and fairness assessment:
- Demographic parity: Do outcomes differ significantly across demographic groups (gender, race, age, disability)?
- Equal opportunity: Does the model achieve similar true positive rates across groups?
- Disparate impact: Does a protected group experience a disproportionately negative outcome?
- Identify proxies for protected attributes in input features
- Recommend fairness evaluation datasets and disaggregated metric reporting
Transparency and explainability:
- Can the system explain its outputs in terms users understand?
- Are explanations required by law (e.g., EU AI Act, GDPR Article 22, FCRA)?
- Evaluate: feature importance, counterfactual explanations, confidence scores, natural language rationale
- Define what the product must communicate to users about AI involvement
Safety and harm prevention:
- Identify potential harms: physical, psychological, financial, reputational, societal
- Content moderation: Does the system generate or surface harmful content (hate speech, CSAM, dangerous instructions, misinformation)?
- Guardrails: Input filtering, output filtering, topic restrictions, role restrictions
- Red-teaming: List adversarial prompt categories to test (jailbreaks, prompt injection, persona attacks)
- Define severity × likelihood risk matrix for each identified harm
- Specify human-in-the-loop checkpoints for high-risk decisions
Privacy and data governance:
- Does the model memorise or reproduce personal data from training?
- Are user inputs used for model training? Is consent obtained?
- Define data minimisation and purpose limitation requirements
- Assess differential privacy or federated learning applicability
Environmental impact:
- Estimate training and inference carbon footprint (use tools like ML CO2 Impact)
- Compare against baseline (fine-tune vs. full train vs. API call)
- Report sustainability metrics in AI product documentation
Regulatory compliance:
- EU AI Act risk tiers: Unacceptable (banned) → High-risk → Limited-risk → Minimal-risk
  - High-risk categories: employment, credit, education, law enforcement, critical infrastructure
- NIST AI RMF: Map to Govern, Map, Measure, Manage functions
- Sector-specific: HIPAA (health), FCRA (credit), COPPA (children), FINRA (financial)
- Document compliance status and gaps
Stakeholder impact analysis:
- Identify all stakeholders: users, operators, third parties, society
- Map positive and negative impacts for each group
- Identify power imbalances (e.g., AI used by employer on employee)
- Define accountability chain: who is responsible for model decisions?
Incident response planning:
- Define AI-specific incident types: bias detection, safety bypass, harmful output, data breach
- Establish escalation path and responsible team members
- Define rollback and mitigation procedures
- Plan post-incident review process (see ai-incident-response skill)
Produce Responsible AI Review document:
- Executive summary: overall risk rating (Low / Medium / High / Critical)
- Findings table: issue, severity, affected group, recommendation, owner
- Required mitigations before launch
- Ongoing monitoring requirements
- Sign-off checklist

Risk Rating Table Template

Issue	Severity	Likelihood	Affected Group	Mitigation	Owner	Status
(e.g., Gender bias in hiring score)	High	Medium	Female applicants	Fairness re-calibration	ML Eng	Open

Think step by step. Save as markdown.