From aws-data-analytics
Inventories and audits AWS Glue Data Catalog assets across S3 tables, Redshift-federated, and remote Iceberg catalogs. For listing catalogs, databases, tables and data landscape overviews.
npx claudepluginhub aws/agent-toolkit-for-aws --plugin aws-data-analytics[search-term|catalog-name|database-name|s3://bucket-path|table-name]This skill uses the workspace's default tool permissions.
Structured inventory and cataloging across your AWS data landscape: Glue Data Catalog with S3 Tables, Redshift-federated, and remote Iceberg catalogs.
Resolves data lake and lakehouse asset references across Glue Data Catalog, S3, S3 Tables, and Redshift using table names, keywords, columns, or S3 paths.
Automates Snowflake data warehouse operations: lists databases/schemas/tables with filters, executes SQL (SELECT/DDL/DML), manages workflows via Composio MCP.
Queries Databricks Unity Catalog system tables for audit, lineage, billing, compute, jobs; manages volumes with upload, download, list, and directory operations.
Share bugs, ideas, or general feedback.
Structured inventory and cataloging across your AWS data landscape: Glue Data Catalog with S3 Tables, Redshift-federated, and remote Iceberg catalogs.
Maps data in an AWS account. Starts with catalog landscape (Glue, S3 Tables, federated), then drills into databases and tables. Read-only — no query execution.
Constraints for parameter acquisition:
Pagination: All list and search calls in this workflow may return paginated results. You MUST pass --next-token from the previous response until no more tokens are returned. You MUST NOT assume a single page contains all results.
Check for required tools and AWS access before discovery.
Constraints:
aws___call_aws, aws___search_documentation) and fall back to AWS CLI if notaws sts get-caller-identityList catalogs in account:
aws glue get-catalogs --recursive --include-root
Classify each catalog by type:
| Field Present | Catalog Type | What It Contains |
|---|---|---|
Neither TargetRedshiftCatalog nor FederatedCatalog | Default (Glue) | Standard Glue databases and tables |
FederatedCatalog.ConnectionName = aws:s3tables | S3 Tables | Managed Iceberg table buckets |
TargetRedshiftCatalog | Redshift-federated | Redshift databases exposed as Glue catalogs |
FederatedCatalog with ConnectionName ≠ aws:s3tables | Remote Iceberg | External catalogs (Snowflake, Databricks, Iceberg REST) |
Constraints:
--include-root to capture default account catalogFor each catalog (or the user-specified one):
aws glue get-databases --catalog-id <catalog-id>
aws glue get-tables --database-name <db> --catalog-id <catalog-id>
For S3 Tables catalogs, also enumerate via the S3 Tables API:
aws s3tables list-table-buckets
aws s3tables list-namespaces --table-bucket-arn <arn>
aws s3tables list-tables --table-bucket-arn <arn> --namespace <ns>
Constraints:
--catalog-id accepts the catalog name (not the ARN)--catalog-id or pass the account IDFor each database, capture table count, formats, partitioning, and S3 locations. For each table of interest, capture column schemas, types, partition keys, SerDe format, and last access time.
You MUST report data formats in human-readable terms (Parquet, CSV, JSON), not raw SerDe class names.
See discovery-checklist.md for analysis framework.
Resolve the argument in this order; stop at the first match:
s3:// — S3 path (explore unregistered data, detect formats)get-catalogs) — deep dive into that catalogget-databases) — deep dive into that databaseget-tables) — detailed table analysis with schema and partitionssearch-tables)start-query-execution) during discovery; query execution belongs to querying-data-lake| Error | Cause | Fix |
|---|---|---|
| Only sub-catalogs returned, default missing | --include-root omitted | Re-run get-catalogs with --include-root |
| Federated catalog query slow or failing | Network call to remote source; connection misconfigured | Report connection errors clearly rather than silently skipping |
| S3 Tables not queryable via Athena | Tables exist in S3 Tables API but not registered in Glue | Flag as "not queryable"; suggest registration |
get-databases/get-tables fails with catalog-id | Default catalog requires omit or account ID | Omit --catalog-id or pass account ID for the default catalog |