From data-agent-kit-starter-pack
Discovers and inspects GCP data assets like BigQuery datasets/tables/views, Spanner databases/tables, BigLake via Dataplex semantic/keyword search and CLI tools. Retrieves schemas, metadata, governance without exact IDs.
npx claudepluginhub gemini-cli-extensions/data-agent-kit-starter-pack --plugin data-agent-kit-starter-packThis skill uses the workspace's default tool permissions.
Dataplex Entries Lookup provides the richest metadata for data assets. You MUST
Implements structured self-debugging workflow for AI agent failures: capture errors, diagnose patterns like loops or context overflow, apply contained recoveries, and generate introspection reports.
Monitors deployed URLs for regressions in HTTP status, console errors, performance metrics, content, network, and APIs after deploys, merges, or upgrades.
Provides React and Next.js patterns for component composition, compound components, state management, data fetching, performance optimization, forms, routing, and accessible UIs.
Dataplex Entries Lookup provides the richest metadata for data assets. You MUST prioritize using it for all Google Cloud assets, even if you already know their IDs.
bigquery-public-data project, Dataplex Entries Lookup will fail. You
MUST skip Steps 2 and 3 and inspect the table directly using the bq CLI or
BigQuery MCP tools instead.You MUST use the Dataplex search command to discover assets and retrieve their
full projects/... entry names. This step is required even if you already know
the asset's short ID (e.g., my_dataset.my_table), because Step 3 strictly
requires the full entry name.
[!IMPORTANT] The
--projectparameter MUST ALWAYS be provided. This project_id is used to attribute the search only and does NOT restrict the search scope. The project must have the dataplex API enabled and user must have thedataplex.entries.getpermissions.
Use this when the user describes the meaning or intent of the data (e.g., "Find Q4 product sales data").
Use the search_entries MCP tool
OR
gcloud dataplex entries search "<NATURAL_LANGUAGE_QUERY>" \
--project="<PROJECT_ID>" \
--semantic-search \
--limit=50
Use this for exact keyword matches or technical strings (e.g., name:order_v2).
AND, OR) MUST be
UPPERCASE. Use plural labels. for label filters (e.g.,
labels.env=prod).label. for label filters (e.g., label.env=prod).| for OR and , for AND within parentheses to
shorten queries (e.g., projectid:(prod|staging) or column:(id,name)).: for token/substring matches (e.g., name:sales).= for exact matches. REQUIRED for system, type, and
location.parent
filter if the project or dataset is known (e.g.,
parent:projects/<PROJECT_ID>).name:x: Substring/token match on resource ID.displayname:x: Substring/token match on display name.projectid:x: Substring/token match on GCP project ID.parent:x: Substring match on hierarchical path (e.g.,
projects/my-proj).location=x: Exact match on location (e.g., us-central1, us).column:x: Substring/token match on column names in the schema.system=x: Exact match on source system. Common values: bigquery,
storage, biglake, cloud_sql, cloud_spanner, cloud_bigtable,
pubsub.type=x: Exact match on entry type (e.g., bigquery-table,
storage-bucket, storage-folder).labels.key=value: (Semantic Mode ONLY) Exact match on a label.label.key=value: (Keyword Mode ONLY) Exact match on a label.createtime[>|<|=]x: Match assets created after/before date
YYYY-MM-DD.fully_qualified_name=x: Exact match on the FQN (e.g.,
bigquery:project.dataset.table).[!TIP] Dataplex search results rely on metadata being ingested into the Universal Catalog (often via Discovery Scans). If an asset is missing from search, it may not be indexed. - Fallback 1: Try searching by the
fully_qualified_namequalifier. - Fallback 2: Use native tools (e.g.,bq show,gcloud storage) or specific skills for that asset type if you already know the ID.
gcloud dataplex entries search "<KEYWORD_SEARCH_QUERY>" \
--project="<PROJECT_ID>" \
--limit=50
Criteria: Once candidate assets are returned, proceed to Step 3 using the full entry names from the search results.
You MUST use the Entries Lookup command to fetch schema and deep metadata for the relevant results obtained from Step 2.
[!IMPORTANT] The argument MUST be the name (starting with
projects/) returned by the search result. Passing short table IDs, GCS URIs, or fully qualifiedbigquery:prefixes is PROHIBITED and will fail.
Use the lookup_entry MCP tool
OR
gcloud dataplex entries lookup "<FULL_ENTRY_NAME>"
Completion Criteria: The command returns the detailed schema and business context.
projects/).bigquery-public-data)
that has not fully synchronized its metadata with the Dataplex Universal
Catalog. While the entry appears in search, entries lookup is unavailable.bq CLI).--project <PROJECT_ID> arguments were not provided--project <PROJECT_ID> arguments
does not have the Dataplex API enabled or the user is missing necessary IAM
permissions.