From honeydew-ai
Queries and analyzes data through the Honeydew semantic layer, supporting structured field queries and natural language deep analyses.
How this skill is triggered — by the user, by Claude, or both
Slash command
/honeydew-ai:queryThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Queries run against the workspace and branch set for the current session. Use `get_session_workspace_and_branch` to check the current context. If no workspace/branch is set, use `list_workspaces`, `list_workspace_branches`, and `set_session_workspace_and_branch` to select one. See the `workspace-branch` skill for the full workspace/branch tool reference.
Queries run against the workspace and branch set for the current session. Use get_session_workspace_and_branch to check the current context. If no workspace/branch is set, use list_workspaces, list_workspace_branches, and set_session_workspace_and_branch to select one. See the workspace-branch skill for the full workspace/branch tool reference.
Honeydew provides three ways to query data through the semantic layer. Each method suits a different situation — pick the right one based on how well you understand the model and how complex the question is.
| Method | Tool | Best For |
|---|---|---|
| Structured query | get_data_from_fields / get_sql_from_fields | You know the exact fields. Deterministic, full control. |
| Deep analysis | initiate_analysis + monitor_analysis | Any natural language question — simple or complex, "why", multi-step, agentic. |
| Explain a prior step | get_analysis_step_details | User asks how a specific step in a prior analysis was calculated. |
| Browse past analyses | list_analysis_chats | User wants to see past conversations or find a prior analysis. |
| Read a past conversation | get_stored_conversation | User wants to read or review the full content of a specific past conversation. |
get_data_from_fields / get_sql_from_fields)Use when:
list_entities / get_entity)detailed_listings.price by detailed_listings.room_type"Do NOT use when:
How it works:
get_data_from_fields — executes the query and returns data rowsget_sql_from_fields — returns the generated SQL without executing (useful for review, debugging, or handing off to other tools)Both take the same field parameters.
initiate_analysis + monitor_analysis)Use when:
conversation_id)User asks a data question
│
├─► User asks to explain / drill into a step from a prior analysis?
│ └─► get_analysis_step_details (step_id from monitor_analysis)
│ Returns semantic query, data results, and SQL for that step
│
├─► User wants to browse past conversations / find a prior analysis?
│ └─► list_analysis_chats (paginated list, newest first)
│
├─► User wants to read the content of a specific past conversation?
│ └─► get_stored_conversation (conversation_id from list_analysis_chats)
│
├─► Do you know the exact field names?
│ │
│ ├─► YES → get_data_from_fields (structured, deterministic)
│ │ (or get_sql_from_fields to preview SQL without executing)
│ │
│ └─► NO → initiate_analysis + monitor_analysis (plain English, any complexity)
│
└─► Plain English question / investigation / "why" / trends?
└─► initiate_analysis + monitor_analysis
A structured query uses flat field parameters to define what data to retrieve:
attributes — dimensions to group by (columns in the output), e.g. ["entity.attribute_name"]metrics — aggregated measures (SUM, COUNT, AVG, etc.), e.g. ["entity.metric_name"]filters — row-level filters applied before aggregation, e.g. ["entity.field = 'value'"]order_by — sort order for results. Each entry MUST be a quoted string, as if it were a SQL identifier — e.g. ["\"entity.field\" ASC"]. Always wrap the field reference in double quotes inside the string.domain — optional domain name for query contextlimit — max rows to return (default: 100)offset — rows to skip (for pagination)All fields use entity.field_name syntax. Cross-entity fields are supported when relations exist.
Before building a query, discover the available fields:
list_entities — see all entitiesget_entity with entity name — see its attributes, metrics, and relationsget_field with entity and field name — get detailed info about a specific fieldlist_domains — see all available domains (useful before passing domain parameter)search_model with a keyword and search_mode (OR for broad discovery, EXACT for known names) — find fields across the modelSimple metric query — total count:
Call get_data_from_fields with:
metrics: ["detailed_listings.count"]Dimension breakdown — listings by room type:
Call get_data_from_fields with:
attributes: ["detailed_listings.room_type"]metrics: ["detailed_listings.count"]order_by: ["\"detailed_listings.count\" DESC"]Filtered query — only entire homes:
Call get_data_from_fields with:
attributes: ["detailed_listings.neighbourhood_cleansed"]metrics: ["detailed_listings.count"]filters: ["detailed_listings.room_type = 'Entire home/apt'"]order_by: ["\"detailed_listings.count\" DESC"]Cross-entity query — listings with host info:
Call get_data_from_fields with:
attributes: ["detailed_listings.room_type", "dim_host.host_is_superhost"]metrics: ["detailed_listings.count"]order_by: ["\"detailed_listings.count\" DESC"]Using aliases — rename fields or ad-hoc expressions:
You can alias any field or ad-hoc expression using AS "alias_name". This controls the column name in the output.
Call get_data_from_fields with:
attributes: ["detailed_listings.room_type"]metrics: ["detailed_listings.count AS \"total_listings\"", "AVG(detailed_listings.price) AS \"avg_price\""]order_by: ["\"total_listings\" DESC"]Once aliased, use the alias (not the original expression) in order_by.
Pagination — large result sets:
Call get_data_from_fields with:
attributes: (your fields)metrics: (your metrics)limit: 50 (max rows to return)offset: 100 (skip first 100 rows)Finding duplicate values:
Call get_data_from_fields with:
attributes: ["detailed_listings.host_name"]metrics: ["COUNT(detailed_listings.host_name)"]filters: ["COUNT(detailed_listings.host_name) > 1"]order_by: ["\"COUNT(detailed_listings.host_name)\" DESC"]This groups by the attribute, counts occurrences, and filters to only rows that appear more than once — surfacing duplicates.
SQL preview only:
Call get_sql_from_fields with the same field parameters to see the generated SQL without executing.
Filters use standard comparison expressions: =, >, <, IN (...), ILIKE, SEARCH(...), IS NULL, booleans, date ranges, and AND/OR combinations.
For the complete filter expression reference — including SEARCH, date handling, and type casting — see the filtering skill.
Deep analysis is a two-step async process:
Step 1 — start the analysis:
Call initiate_analysis with:
question (required): the analysis questionagent (required for new conversations): agent name — use list_agents to discover available agents and their associated domains. Workspace and branch must be set before calling list_agents (agents are workspace-scoped)conversation_id (optional): ID from a previous call, for follow-up questionsReturns a conversation_id immediately. New conversations also return a ui_url — always display this URL to the user so they can follow the analysis in the Honeydew application.
Step 2 — poll until done:
Call monitor_analysis repeatedly with the conversation_id until status is "DONE". Each call returns only new messages since the last call.
stop_reason values — what they mean:
stop_reason | Meaning | What to do |
|---|---|---|
null | Still running | Keep polling |
"DONE" | Finished normally | Read responses array for the final report |
"ASK" | Analysis paused to ask a clarifying question | Read responses for the question, answer with a new initiate_analysis on the same conversation_id |
"ABORTED" | This execution run was stopped | Not terminal. Conversation state is preserved — continue with a new initiate_analysis on the same conversation_id |
Report progress when it's meaningful to the user — not every poll, but not silently either. Use judgement:
interpretation / plan: tell the user what the analysis intends to dostep_insight with a substantive finding: post a one-liner describing what was found — e.g. "58 menu items found, top is The King Combo at $431M"When status is "DONE", the final user-facing report is in the responses array. If the user then expresses satisfaction or dissatisfaction with the result, call provide_analysis_feedback with the conversation_id:
"Good"<Reason>: <details> where Reason is one of Chart Issue, Data Issue, Wrong Judgement, or Other — e.g. "Data Issue: revenue figures don't match our BI tool"nullTrigger this when the user says things like: "that was correct", "that's wrong", "the data looks off", "great analysis", "this is incorrect because..."
# Example
initiate_analysis(question="Analyze revenue by cuisine type", agent="my_agent")
→ { conversation_id: "abc123", ui_url: "https://..." }
# Poll loop — report to user after each call that has new content
monitor_analysis(conversation_id="abc123")
→ interpretation + plan received → tell user what the analysis will do
monitor_analysis(conversation_id="abc123")
→ step insight received → report: "Top item is Indian at $1B"
monitor_analysis(conversation_id="abc123")
→ step insight received → report: "% contribution: Indian leads at 34%, followed by Italian at 22%"
...
monitor_analysis(conversation_id="abc123")
→ { status: "DONE", responses: [{ text: "..." }] }
Use conversation_id from the previous analysis to ask follow-up questions that build on prior state. Start a new conversation (omit conversation_id) when the topic changes completely — reusing a conversation for an unrelated question carries stale context into the new analysis and can skew results.
initiate_analysis(
question="Now break this down by room type — does the pattern hold?",
conversation_id="abc123"
)
Aborting stops the current execution run but preserves all conversation state — groups already computed remain available. Resume by sending a new initiate_analysis with the same conversation_id; the analysis will reuse prior results rather than restarting from scratch.
Always abort when the user asks. Initiating an abort on your own (without the user asking) is reserved for extreme cases where the analysis is clearly going badly off track — normally let it finish and redirect via a follow-up question.
When the user asks to explain, review, or drill into a specific step (e.g., "how was that calculated?", "explain the cuisine breakdown", "what happened in step 5"):
step_id from monitor_analysis progress messages (e.g. "STEP/5")get_analysis_step_details with conversation_id and step_idReturns:
References in the response can be expanded for deeper inspection:
entity.field_name) → get_field — shows the full definition, useful if the user wants to understand how a metric or attribute is calculatedget_context_item — shows the instruction or knowledge applied during that stepSimple natural language:
Complex analysis:
Step explanation (use get_analysis_step_details, not a new question):
Returns a paginated list of past analysis conversations for the current workspace, sorted newest first.
conversation_id, title, domain, agent, creation time, user feedback, and the display name of the user who created it.limit and offset for pagination.list_analysis_chats(limit=20, offset=0)
Use this when the user asks to:
Returns all messages from a past analysis conversation. Use this to read the full content of a specific conversation identified via list_analysis_chats.
is_complete in the response indicates whether the conversation reached a terminal state.with_step_ids: false (default) — returns only final responses and plan/interpretation messages. Use this for a readable summary.with_step_ids: true — also includes step_start and step_insight progress entries. Use this when you need to identify specific steps to pass to get_analysis_step_details.get_stored_conversation(conversation_id="abc123")
get_stored_conversation(conversation_id="abc123", with_step_ids=true)
For complex tasks, combine methods in sequence:
list_entities / get_entity to understand the modelget_data_from_fields for precise, targeted queriesinitiate_analysis + monitor_analysis for root cause or trend analysisUser: "Help me understand pricing patterns for Airbnb listings."
list_entities → find detailed_listingsget_entity for detailed_listings → find price, room_type, neighbourhood_cleansedget_data_from_fields → price distribution by neighbourhood for Entire homes onlyinitiate_analysis → "What factors most influence listing price? Analyze correlations with room type, location, amenities, and reviews."Use the honeydew-docs MCP tools to search the Honeydew documentation when:
Search for topics like: "queries", "perspectives", "dynamic datasets", "parameters", "query API".
To retrieve the distinct (unique) values of a field, include it in attributes and add a count metric in metrics.
Use the entity's built-in count metric (e.g., entity.count) if available, or an ad-hoc count metric using COUNT(entity.field)
on the field whose distinct values you want — never use COUNT(*).
The metric forces aggregation, which groups by the attribute and returns one row per distinct value.
Example — distinct room types:
Call get_data_from_fields with:
attributes: ["detailed_listings.room_type"]metrics: ["COUNT(detailed_listings.room_type)"]order_by: ["\"COUNT(detailed_listings.room_type)\" DESC"]This returns each unique room_type along with its count, ordered by frequency. The count is a useful bonus — it tells you how common each value is — but the key point is that the query returns one row per distinct value.
This pattern is useful for:
Present analysis results clearly — format tables, highlight key numbers, and surface what's notable, unexpected, or actionable.
If your environment has visualization tools, render visualizations when they would reveal patterns faster than text — e.g. trends, distributions, or ranked lists of 5+ items. Skip it for 2–3 numbers or a single yes/no conclusion. When it might help but isn't a clear win, offer rather than render.
list_entities / get_entity before building queries, so you reference real fieldsget_data_from_fields gives you full control and reproducible resultsget_analysis_step_details with the step_id; the response includes the semantic query, data results, and SQL for that steplimit and offset in get_data_from_fields to avoid overwhelming outputget_sql_from_fields to inspect the generated queryentity.field_name syntax in field parametersnpx claudepluginhub honeydew-ai/honeydew-ai-coding-agents-plugins --plugin honeydew-aiRuns queries against Omni Analytics' semantic layer via the Omni CLI, interprets results, and chains queries for multi-step analysis. Useful for extracting data from Omni dashboards or workbooks programmatically.
Explores Honeydew semantic layer: discovers entities/fields, sets workspace/branch context, runs simple structured queries to inspect data.
Assesses if business questions can be answered with Bauplan lakehouse data by mapping concepts to tables/columns, checking subset quality, validating semantic fit, and producing verdict with feasibility report.