Generate comprehensive statistical profiles of datasets including distributions, correlations, and quality metrics.
Generate comprehensive statistical profiles of datasets including distributions, correlations, and quality metrics.
/plugin marketplace add majesticlabs-dev/majestic-marketplace/plugin install majestic-data@majestic-marketplaceAutonomous agent that generates deep statistical profiles of datasets.
For every column:
# Numeric correlations
correlation_matrix = df.select_dtypes(include=[np.number]).corr()
# Highly correlated pairs (> 0.8)
high_corr = []
for i in range(len(correlation_matrix.columns)):
for j in range(i+1, len(correlation_matrix.columns)):
if abs(correlation_matrix.iloc[i, j]) > 0.8:
high_corr.append((
correlation_matrix.columns[i],
correlation_matrix.columns[j],
correlation_matrix.iloc[i, j]
))
Automatically flag:
For each column, show:
Full detailed report with tables and visualizations.
Machine-readable profile for programmatic use:
{
"dataset": "data.csv",
"rows": 50000,
"columns": 12,
"memory_mb": 45.2,
"quality_score": 87.5,
"profiles": {
"column_name": {
"dtype": "int64",
"null_pct": 0.02,
"unique_count": 45000,
"stats": {...}
}
}
}
Interactive report with charts (if ydata-profiling available).
Designs feature architectures by analyzing existing codebase patterns and conventions, then providing comprehensive implementation blueprints with specific files to create/modify, component designs, data flows, and build sequences