Skill

anomaly-detector

Identify statistical anomalies, outliers, and unusual patterns in datasets. Use when users ask to find anomalies, detect outliers, identify unusual patterns, spot irregularities, or analyze data for unexpected behavior. Supports time-series analysis, distribution-based detection, and pattern recognition for numerical and categorical data.

Popularity

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/kyvos:anomaly-detector

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

This skill identifies anomalies in data using multiple statistical methods. It can detect unusual values in numerical data, unexpected shifts in time-series data, and rare occurrences in categorical data.

Supporting Files

references/categorical_methods.mdreferences/statistical_methods.mdreferences/timeseries_methods.mdscripts/detect_anomalies.py

SKILL.md

108 lines · ~1.2k tokens

Stats

LanguagePython

Parent stars0

Parent forks1

MaintenanceFair

Last CommitFeb 18, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Anomaly Detector

Numerical Anomaly Detection

For numeric columns, anomalies are typically values that fall far from the central tendency of the data.

Z-Score Method

This method is best for data that is approximately normally distributed. It measures how many standard deviations a data point is from the mean.

A Z-score > 3 is generally considered an outlier.

# Assumes data is in a pandas DataFrame 'df' and we're checking 'value' column
z_scores = (df['value'] - df['value'].mean()) / df['value'].std()
anomalies = df[abs(z_scores) > 3]

IQR (Interquartile Range) Method

This method is robust to outliers and does not assume a normal distribution, making it suitable for skewed data. An anomaly is a value that falls outside the range defined by Q1 - 1.5IQR and Q3 + 1.5IQR.

# Assumes data is in a pandas DataFrame 'df' and we're checking 'value' column
Q1 = df['value'].quantile(0.25)
Q3 = df['value'].quantile(0.75)
IQR = Q3 - Q1
lower_bound = Q1 - 1.5 * IQR
upper_bound = Q3 + 1.5 * IQR
anomalies = df[(df['value'] < lower_bound) | (df['value'] > upper_bound)]

Percentile Method

A simple method to identify extreme values by defining anomalies as values that fall in the top or bottom X% of the data.

# Identify values in the bottom 1% or top 1%
anomalies = df[(df['value'] < df['value'].quantile(0.01)) |
              (df['value'] > df['value'].quantile(0.99))]

Time-Series Anomaly Detection

For time-series data, anomalies can be sudden spikes/dips or deviations from a recurring pattern (seasonality).

Moving Average Deviation

This method identifies values that deviate significantly from a rolling average, which helps smooth out short-term noise.

# Assumes 'df' has a datetime index and a 'value' column
# Calculate 7-period moving average
df['moving_average'] = df['value'].rolling(window=7).mean()
# Calculate deviation from moving average
df['deviation'] = df['value'] - df['moving_average']
# Identify points with a large deviation (e.g., > 3 standard deviations of the deviation)
anomaly_threshold = df['deviation'].std() * 3
anomalies = df[abs(df['deviation']) > anomaly_threshold]

Other Time-Series Methods

Seasonal Decomposition: Separates a time series into trend, seasonal, and residual components. Anomalies are often found in the residual component.
Change Point Detection: Identifies points in time where the statistical properties of the data (e.g., mean, variance) change significantly.

Categorical Anomaly Detection

For categorical data, anomalies are often categories that appear with unusually low frequency.

Frequency Analysis

Identify categories that are rare compared to others.

# Assumes 'df' has a 'category' column
frequency = df['category'].value_counts(normalize=True)
# Identify categories that make up less than 1% of the data
rare_categories = frequency[frequency < 0.01].index.tolist()
anomalies = df[df['category'].isin(rare_categories)]

Handling Anomalies

Do NOT automatically remove anomalies. Instead:

Investigate: Is this a data entry error, a genuine but extreme value, or a sign of a different underlying process?
Data errors: Fix or remove if they are clearly incorrect (e.g., negative age, future dates).
Genuine extremes: Keep them, but consider using robust statistics (e.g., median instead of mean) for analysis. Report their existence and potential impact.
Segment: If anomalies represent a distinct group (e.g., enterprise customers vs. individual users), analyze them as a separate segment.

Always report how anomalies were identified and handled.

Common Use Cases

Quality control: Detect manufacturing defects or measurement errors.
Sales analysis: Identify unusual sales patterns or inventory issues.
Fraud detection: Flag suspicious transactions or user behavior.
Sensor data: Detect equipment failures or sensor malfunctions.
Business metrics: Spot unexpected changes in KPIs or performance metrics.

anomaly-detector

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

anomaly-detector

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Anomaly Detector

Numerical Anomaly Detection

Z-Score Method

IQR (Interquartile Range) Method

Percentile Method

Time-Series Anomaly Detection

Moving Average Deviation

Other Time-Series Methods

Categorical Anomaly Detection

Frequency Analysis

Handling Anomalies

Common Use Cases

Similar Skills

Anomaly Detector

Numerical Anomaly Detection

Z-Score Method

IQR (Interquartile Range) Method

Percentile Method

Time-Series Anomaly Detection

Moving Average Deviation

Other Time-Series Methods

Categorical Anomaly Detection

Frequency Analysis

Handling Anomalies

Common Use Cases

Similar Skills