Skill

ai-training-lawfulness

Assesses GDPR lawful basis for AI training data processing per EDPB 2025 LLM guidelines. Covers legitimate interest balancing, consent challenges, web scraping lawfulness, and public datasets.

ai-ml

security

npx claudepluginhub mukul975/privacy-data-protection-skills --plugin ai-privacy-governance-skills

Tool Access

This skill uses the workspace's default tool permissions.

Preview

The processing of personal data for AI model training constitutes a distinct processing operation requiring its own lawful basis under GDPR Art. 6(1). The EDPB Guidelines 04/2025 and the coordinated ChatGPT Taskforce findings establish that AI training creates unique lawful basis challenges: the scale of data collection, the difficulty of obtaining meaningful consent for open-ended AI training ...

Supporting Assets

assets/template.mdreferences/standards.mdreferences/workflows.mdscripts/process.py

SKILL.md

Similar Skills

github-deep-research

63.9k

Conducts multi-round deep research on GitHub repos via API and web searches, generating markdown reports with executive summaries, timelines, metrics, and Mermaid diagrams.

2 files

bytedance-deer-flow-1

surprise-me

63.9k

Dynamically discovers and combines enabled skills into cohesive, unexpected delightful experiences like interactive HTML or themed artifacts. Activates on 'surprise me', inspiration, or boredom cues.

bytedance-deer-flow-1

image-generation

63.9k

Generates images from structured JSON prompts via Python script execution. Supports reference images and aspect ratios for characters, scenes, products, visuals.

2 files

bytedance-deer-flow-1

Stats

Parent Repo Stars37

Parent Repo Forks4

Last CommitMar 15, 2026

Used By2 plugins

Actions

View Source View Plugin View on GitHub View README

Requirement	AI Training Application
Freely given	Data subjects must have genuine choice; consent cannot be bundled with service access unless AI training is necessary for the service
Specific	"AI training" alone is insufficient — must specify what type of model, for what purpose, what data elements are used
Informed	Must explain how personal data will be used in training, retention period for training data, risk of model memorization, inability to fully delete data from trained models
Unambiguous	Clear affirmative action; pre-ticked boxes or implied consent from terms of service are insufficient
Withdrawable	Controller must provide mechanism to withdraw consent; but model already trained on the data presents technical challenge

Interest Type	Example	EDPB Assessment
Commercial product improvement	Training a fraud detection model to protect customers	Generally legitimate — concrete benefit to data subjects
Research and development	Training models for medical imaging analysis	Legitimate if research purpose is genuine and specific
General AI capability	Training a foundation model for general-purpose use	Scrutinised — interest must be articulated with specificity
Competitive advantage	Training to match competitor AI capabilities	Legitimate commercial interest but weak in balancing

Question	Assessment Criteria
Is AI training necessary for the identified interest?	Could the interest be pursued without training on personal data?
Could anonymised data achieve the same result?	Has the controller tested model performance with anonymised data?
Could synthetic data supplement or replace personal data?	Has synthetic data generation been evaluated?
Is the volume of personal data proportionate?	Has the minimum effective dataset been determined?
Could federated learning avoid centralising personal data?	Has distributed training been assessed?

Factor	High Lawfulness Indicator	Low Lawfulness Indicator
Data source	Explicitly open-licence data (CC0, public domain)	Personal profiles, social media, private websites
Data type	Factual, non-personal content	Identifiable personal information, photos, opinions
Data subject expectations	Data published with intent for wide reuse	Data shared in specific context (social media, forums)
Safeguards	Differential privacy, PII filtering pre-training	No preprocessing to remove personal data
Opt-out	Effective and accessible opt-out mechanism	No opt-out or technically impractical opt-out
Transparency	Privacy notice covers AI training use	No notice to data subjects about AI training

Right	AI Training Application	Technical Challenge
Access (Art. 15)	Data subject can request confirmation that their data was used in training and receive a copy	Identifying specific records in large training datasets
Rectification (Art. 16)	Inaccurate personal data in training sets must be corrected	Correction may require model retraining
Erasure (Art. 17)	Data subjects can request deletion of their data from training sets	Requires machine unlearning or model retraining
Objection (Art. 21)	Data subjects can object to processing based on legitimate interest	Controller must cease processing unless compelling grounds override
Restriction (Art. 18)	Processing must be restricted while accuracy or objection is contested	May require quarantining data from training pipeline

ai-training-lawfulness

Tool Access

Preview

Supporting Assets

SKILL.md

Similar Skills

ai-training-lawfulness

Tool Access

Preview

Supporting Assets

SKILL.md

Lawful Basis for AI Training Data

Overview

Fundamental Principles

AI Training as Personal Data Processing

Purpose Specification for AI Training

Lawful Basis Analysis for AI Training

Art. 6(1)(a) — Consent

Requirements for Valid AI Training Consent

Consent Challenges in AI Training

When Consent Works for AI Training

Art. 6(1)(b) — Contract Necessity

Art. 6(1)(f) — Legitimate Interest

Part 1: Legitimate Interest Identification

Part 2: Necessity Assessment

Part 3: Balancing Test

EDPB Position on Legitimate Interest for AI Training

Art. 6(1)(e) — Public Interest

Special Situations

Web-Scraped Data

Third-Party Datasets

Public Datasets

Training Data Retention

Data Subject Rights for Training Data

Enforcement Precedents

Integration Points

Similar Skills

Lawful Basis for AI Training Data

Overview

Fundamental Principles

AI Training as Personal Data Processing

Purpose Specification for AI Training

Lawful Basis Analysis for AI Training

Art. 6(1)(a) — Consent

Requirements for Valid AI Training Consent

Consent Challenges in AI Training

When Consent Works for AI Training

Art. 6(1)(b) — Contract Necessity

Art. 6(1)(f) — Legitimate Interest

Part 1: Legitimate Interest Identification

Part 2: Necessity Assessment

Part 3: Balancing Test

EDPB Position on Legitimate Interest for AI Training

Art. 6(1)(e) — Public Interest

Special Situations

Web-Scraped Data

Third-Party Datasets

Public Datasets

Training Data Retention

Data Subject Rights for Training Data

Enforcement Precedents

Integration Points