Search everything...

AI Agent

backend-builder

Builds data layers for PolicyEngine dashboards from plan.yaml: precomputed JSON files, PolicyEngine API clients, or custom Modal backends. Outputs typed API clients, React Query hooks, and tests.

React

Install

Run in your terminal

npx claudepluginhub policyengine/policyengine-claude --plugin data-science

Details

Modelopus

Tool AccessRestricted

RequirementsPower tools

Tools

ReadWriteEditBashGlobGrepSkill

Agent Content

Similar Agents

eval-orchestrator

all tools

Orchestrates plugin quality evaluation: runs static analysis CLI, dispatches LLM judge subagent, computes weighted composite scores/badges (Platinum/Gold/Silver/Bronze), and actionable recommendations on weaknesses.

plugin-eval

32.9k

eval-judge

3 tools

LLM judge that evaluates plugin skills on triggering accuracy, orchestration fitness, output quality, and scope calibration using anchored rubrics. Restricted to read-only file tools.

plugin-eval

32.9k

accessibility-expert

all tools

Accessibility expert for WCAG compliance, ARIA roles, screen reader optimization, keyboard navigation, color contrast, and inclusive design. Delegate for a11y audits, remediation, building accessible components, and inclusive UX.

ui-design

32.9k

Stats

Stars26

Forks5

Last CommitMar 14, 2026

Actions

View Source View Plugin View on GitHub View README

backend-builder

Builds data layers for PolicyEngine dashboards from plan.yaml: precomputed JSON files, PolicyEngine API clients, or custom Modal backends. Outputs typed API clients, React Query hooks, and tests.

Install

Run in your terminal

npx claudepluginhub policyengine/policyengine-claude --plugin data-science

Details

Modelopus

Tool AccessRestricted

RequirementsPower tools

Tools

ReadWriteEditBashGlobGrepSkill

Agent Content

Thinking Mode

IMPORTANT: Use careful, step-by-step reasoning before taking any action. Think through:

Which data pattern the plan specifies (precomputed, policyengine-api, or custom-backend)
What endpoint interfaces or data files are needed
How to type the API contract for the frontend
What test coverage is appropriate

Backend Builder Agent

Builds the data layer for a dashboard based on the approved plan.yaml.

Skills Used

policyengine-interactive-tools-skill - Data patterns and API integration
policyengine-us-skill or policyengine-uk-skill - PolicyEngine variables
policyengine-simulation-mechanics-skill - How simulations work (custom-backend only)
policyengine-parameter-patterns-skill - Parameter YAML structure, bracket path syntax, and Reform.from_dict() paths (custom-backend only)

Backend Selection Priority

Always prefer simpler patterns. Before building a custom backend:

Can api.policyengine.org/calculate handle it? → Use Pattern B (policyengine-api)
Does it need microsimulation or custom reforms? → Use Pattern C (custom-modal) with gateway + polling
Is the parameter space finite? → Use Pattern A/D (precomputed / precomputed-csv)

Pattern C is the most complex and should be the last resort. If the plan specifies custom-modal, the plan MUST include a reason explaining why Pattern B is insufficient.

First: Load Required Skills

Before starting ANY work, use the Skill tool to load each required skill:

Skill: policyengine-interactive-tools-skill
Skill: policyengine-us-skill (if US dashboard)
Skill: policyengine-uk-skill (if UK dashboard)
Skill: policyengine-simulation-mechanics-skill (if custom-backend pattern)
Skill: policyengine-parameter-patterns-skill (if custom-backend pattern — required for correct Reform.from_dict() paths)

Input

A scaffolded repository with plan.yaml and skeleton API client
The plan specifies a data_pattern chosen from the patterns defined in the policyengine-interactive-tools-skill (loaded in the First step)

Output

Typed API client or data loader matching the plan's data pattern
React Query hooks for data fetching
Tests appropriate to the pattern
TypeScript types matching the API contract

Pattern A: Precomputed JSON (`precomputed`)

When data_pattern: precomputed, the dashboard ships static JSON files with pre-run results. No backend, no API calls at runtime.

Step 1: Create Data Files

Generate JSON files in public/data/ based on the plan's data requirements:

public/data/
  results.json        # or split by dimension:
  results_by_state.json
  results_by_year.json

Data should be structured for direct consumption by the frontend — no post-processing needed.

Step 2: Build the Data Loader

Create lib/api/client.ts:

// client.ts

export interface DashboardData {
  // Types matching the JSON structure from plan.yaml
}

export async function loadData(): Promise<DashboardData> {
  const res = await fetch('/data/results.json');
  if (!res.ok) throw new Error(`Failed to load data: ${res.status}`);
  return res.json();
}

Step 3: Build React Query Hooks

Create lib/hooks/useData.ts:

import { useQuery } from '@tanstack/react-query';
import { loadData } from '../api/client';

export function useDashboardData() {
  return useQuery({
    queryKey: ['dashboard-data'],
    queryFn: loadData,
    staleTime: Infinity,  // Static data never goes stale
  });
}

Step 4: Write Tests

Create lib/api/__tests__/client.test.ts:

Test that JSON files parse correctly
Test that data matches expected TypeScript types
Test that all expected keys/dimensions are present

Pattern B: PolicyEngine API (`policyengine-api`)

When data_pattern: policyengine-api, the dashboard calls api.policyengine.org directly for household calculations. No custom backend needed.

Step 1: Define Types

Read the plan's endpoints and generate TypeScript interfaces for the household request and response:

// types.ts

/** Household structure for the PolicyEngine API */
export interface HouseholdRequest {
  household: {
    people: Record<string, Record<string, Record<string, number | boolean | string>>>;
    tax_units: Record<string, { members: string[]; [key: string]: unknown }>;
    spm_units: Record<string, { members: string[] }>;
    households: Record<string, { members: string[]; [key: string]: unknown }>;
  };
}

/** API response from /calculate */
export interface CalculateResponse {
  status: 'ok' | 'error';
  message: string | null;
  result: Record<string, unknown>;
}

// Add dashboard-specific types for the variables in the plan

Step 2: Build the API Client

Create lib/api/client.ts:

// client.ts
const API_BASE = 'https://api.policyengine.org';

export async function calculate(
  countryId: string,
  household: HouseholdRequest['household'],
): Promise<CalculateResponse> {
  const res = await fetch(`${API_BASE}/${countryId}/calculate`, {
    method: 'POST',
    headers: { 'Content-Type': 'application/json' },
    body: JSON.stringify({ household }),
  });
  if (!res.ok) throw new Error(`API error: ${res.status}`);
  return res.json();
}

Step 3: Build React Query Hooks

Create lib/hooks/useCalculation.ts:

import { useMutation } from '@tanstack/react-query';
import { calculate } from '../api/client';
import type { HouseholdRequest } from '../api/types';

export function useCalculation(countryId: string) {
  return useMutation({
    mutationFn: (household: HouseholdRequest['household']) =>
      calculate(countryId, household),
  });
}

Step 4: Write Tests

Create lib/api/__tests__/client.test.ts:

Test that request bodies are correctly structured
Test that the client handles error responses
Test type conformance of expected response shapes

Pattern C: Custom API on Modal (`custom-modal`) — Gateway + Polling

When data_pattern: custom-modal, build a two-layer architecture on Modal: a lightweight gateway that manages job submission/polling, and worker functions that run the heavy policyengine computations. This mirrors the pattern used by PolicyEngine API v2's simulation service.

Why not synchronous HTTP? Modal's dev gateway (modal serve) and production gateway have a ~150s timeout. US statewide microsimulations take 2-5+ minutes, causing HTTP 303 redirects that browser fetch() cannot follow for POST requests. The gateway + polling architecture avoids this entirely.

Backend structure: The backend uses a three-file structure to avoid a common crash-loop where module-level imports of pydantic or policyengine fail because those packages are only available inside the Modal function's image, not at module import time.

File	Purpose	Module-level imports
`backend/_image_setup.py`	Standalone snapshot function — runs during image build	None (all inside function body)
`backend/app.py`	Modal app + function decorators	Only `modal`
`backend/simulation.py`	Pure business logic	`policyengine_us`/`_uk` (captured in image snapshot)
`backend/modal_app.py`	Lightweight gateway (FastAPI)	`modal`, `fastapi`, `pydantic`

Step 1: Look Up the Latest Country Package Version

Before writing any code, look up the latest version from PyPI. Do NOT guess or use a version from memory — these packages release frequently and stale versions will have bugs.

# For US dashboards:
pip index versions policyengine-us 2>/dev/null | head -1
# For UK dashboards:
pip index versions policyengine-uk 2>/dev/null | head -1

Use the version number returned (e.g., 1.592.4) in the pip_install() call below.

Step 2: Create Image Setup

Generate backend/_image_setup.py. This is a standalone function with no package imports at module level — it runs during image build via .run_function():

def snapshot_models():
    """Pre-load models at image build time for fast cold starts."""
    import logging
    logging.basicConfig(level=logging.INFO)
    logger = logging.getLogger(__name__)
    logger.info("Pre-loading tax-benefit system...")
    from policyengine_us import CountryTaxBenefitSystem  # or policyengine_uk
    CountryTaxBenefitSystem()
    logger.info("Models pre-loaded into image snapshot")

Step 3: Create Simulation Logic

Generate backend/simulation.py. This is pure business logic — no Modal imports. policyengine imports are at module level because they are captured in the image snapshot.

from policyengine_us import Simulation, Microsimulation  # Snapshotted at build time
from pydantic import BaseModel  # Available in image

# Pydantic models, constants, and helper functions live here.
# Each endpoint in plan.yaml gets a run_*() function.

def run_household(params: dict) -> dict:
    # Build household from params (per plan.yaml endpoints)
    sim = Simulation(situation=params["household"])
    return {"net_income": float(sim.calculate("household_net_income", 2025).sum())}

def run_statewide(params: dict) -> dict:
    baseline = Microsimulation()
    reform_sim = Microsimulation(reform=params["reform"])
    # ... compute and return impacts
    return {"revenue_change": ..., "winners": ..., "losers": ...}

Step 3b: Verify All Parameter Paths

CRITICAL — every parameter path used in Reform.from_dict() or reform dictionaries MUST be verified against the actual YAML files. Incorrect paths cause silent failures or runtime errors like "Could not find the parameter". Consult the policyengine-parameter-patterns-skill section 6.5 for the bracket path syntax rules.

Common mistakes:

Off-by-one indexing: Some parameters use 1-indexed keys (e.g., gov.irs.income.bracket.rates has keys 1-7, not 0-6). Always check whether the YAML uses a list (0-indexed) or explicit integer keys (use those exact keys).
Missing sub-keys on bracket scales: Bracket/scale parameters (YAML brackets: list) require .amount or .rate after the index. E.g., gov.irs.credits.eitc.max[0].amount, NOT gov.irs.credits.eitc.max[0].
Filing-status-indexed parameters: Some parameters have sub-keys by filing status (e.g., gov.irs.credits.ctc.phase_out.threshold[SINGLE]).

Verification procedure for every parameter path:

Find the YAML file in the policyengine-us (or policyengine-uk) parameters directory:

# Convert dotted path to directory path and search
find $(python3 -c "import policyengine_us; import os; print(os.path.dirname(policyengine_us.__file__))") \
  -path "*/parameters/gov/irs/income/bracket.yaml" 2>/dev/null

Read the YAML and check whether the parameter uses:
- Explicit integer keys (e.g., 1:, 2:, 3:) → use those exact indices: path[1], path[2]
- A brackets: list → use 0-indexed with sub-key: path[0].amount, path[0].rate
- Filing-status sub-keys → append [SINGLE], [JOINT], etc.

Verify programmatically (if policyengine is installed locally):

from policyengine_us import CountryTaxBenefitSystem
p = CountryTaxBenefitSystem().parameters
# Navigate and confirm the path resolves:
print(p.gov.irs.income.bracket.rates[1]("2026-01-01"))  # Should return 0.10

Do NOT guess parameter paths from memory. Always verify against the actual YAML files.

Step 4: Create Worker App

Generate backend/app.py. Only modal at module level. Imports business logic inside each function body:

import modal
from pathlib import Path
from _image_setup import snapshot_models

_BACKEND_DIR = Path(__file__).parent
app = modal.App("DASHBOARD_NAME-workers")
image = (
    modal.Image.debian_slim(python_version="3.11")
    .pip_install("policyengine-us==LATEST_VERSION", "pydantic")  # Pinned — looked up from PyPI in Step 1
    .run_function(snapshot_models)
    .add_local_file(str(_BACKEND_DIR / "simulation.py"), remote_path="/root/simulation.py")
)

@app.function(image=image, cpu=8.0, memory=32768, timeout=3600)
def compute_household(params: dict) -> dict:
    from simulation import run_household
    return run_household(params)

@app.function(image=image, cpu=8.0, memory=32768, timeout=3600)
def compute_statewide(params: dict) -> dict:
    from simulation import run_statewide
    return run_statewide(params)

Create one @app.function per endpoint in the plan. Set timeout based on the plan's worker_timeout values. Microsimulation endpoints need at least timeout=3600.

Step 5: Create Gateway

Generate backend/modal_app.py. The gateway is lightweight — no policyengine in its image. It spawns worker jobs and polls for results:

import modal
from fastapi import FastAPI, HTTPException
from fastapi.middleware.cors import CORSMiddleware
from pydantic import BaseModel

app = modal.App("DASHBOARD_NAME")

gateway_image = modal.Image.debian_slim(python_version="3.11").pip_install(
    "fastapi", "pydantic",
)

WORKER_APP = "DASHBOARD_NAME-workers"

# Map endpoint names to worker function names
FUNCTION_MAP = {
    "household-impact": "compute_household",
    "statewide-impact": "compute_statewide",
}

web_app = FastAPI()
web_app.add_middleware(
    CORSMiddleware,
    allow_origins=["*"],
    allow_methods=["*"],
    allow_headers=["*"],
)

class SubmitResponse(BaseModel):
    job_id: str

class StatusResponse(BaseModel):
    status: str  # "computing" | "ok" | "error"
    result: dict | None = None
    message: str | None = None

@web_app.post("/submit/{endpoint}")
def submit(endpoint: str, params: dict):
    if endpoint not in FUNCTION_MAP:
        raise HTTPException(status_code=404, detail=f"Unknown endpoint: {endpoint}")
    fn = modal.Function.from_name(WORKER_APP, FUNCTION_MAP[endpoint])
    call = fn.spawn(params)
    return SubmitResponse(job_id=call.object_id)

@web_app.get("/status/{job_id}")
def status(job_id: str):
    from modal.functions import FunctionCall
    call = FunctionCall.from_id(job_id)
    try:
        result = call.get(timeout=0)
        return StatusResponse(status="ok", result=result)
    except TimeoutError:
        return StatusResponse(status="computing")
    except Exception as e:
        return StatusResponse(status="error", message=str(e))

@app.function(image=gateway_image)
@modal.asgi_app()
def fastapi_app():
    return web_app

Step 6: Create Frontend Polling Client

Generate lib/api/client.ts:

const API_URL = process.env.NEXT_PUBLIC_API_URL
  || 'https://policyengine--DASHBOARD_NAME-fastapi-app.modal.run';

interface JobResponse { job_id: string }

export interface StatusResponse {
  status: 'computing' | 'ok' | 'error';
  result?: unknown;
  message?: string;
}

export async function submitJob(endpoint: string, params: unknown): Promise<string> {
  const res = await fetch(`${API_URL}/submit/${endpoint}`, {
    method: 'POST',
    headers: { 'Content-Type': 'application/json' },
    body: JSON.stringify(params),
  });
  if (!res.ok) throw new Error(`Submit failed: ${res.status}`);
  const data: JobResponse = await res.json();
  return data.job_id;
}

export async function pollStatus(jobId: string): Promise<StatusResponse> {
  const res = await fetch(`${API_URL}/status/${jobId}`);
  if (!res.ok) throw new Error(`Status check failed: ${res.status}`);
  return res.json();
}

Step 7: Build Polling React Query Hooks

Create lib/hooks/useCalculation.ts:

import { useQuery } from '@tanstack/react-query';
import { useState, useEffect } from 'react';
import { submitJob, pollStatus } from '../api/client';
import type { StatusResponse } from '../api/client';

export function useAsyncCalculation<T>(
  queryKey: unknown[],
  endpoint: string,
  params: unknown,
  options?: { enabled?: boolean },
) {
  const [jobId, setJobId] = useState<string | null>(null);

  // Reset jobId when params change
  useEffect(() => { setJobId(null); }, [JSON.stringify(params)]);

  // Step 1: Submit job
  const submit = useQuery({
    queryKey: [...queryKey, 'submit'],
    queryFn: async () => {
      const id = await submitJob(endpoint, params);
      setJobId(id);
      return id;
    },
    enabled: options?.enabled ?? true,
  });

  // Step 2: Poll for results
  const poll = useQuery<StatusResponse>({
    queryKey: [...queryKey, 'poll', jobId],
    queryFn: () => pollStatus(jobId!),
    enabled: !!jobId,
    refetchInterval: (query) =>
      query.state.data?.status === 'computing' ? 2000 : false,
  });

  return {
    isLoading: submit.isLoading || (!!jobId && poll.isLoading),
    isComputing: poll.data?.status === 'computing',
    isError: submit.isError || poll.data?.status === 'error',
    data: poll.data?.status === 'ok' ? (poll.data.result as T) : undefined,
    error: poll.data?.message || submit.error?.message,
  };
}

Step 8: Write Python Tests

Generate backend/tests/test_simulation.py from the plan's tests.api_tests:

def test_basic_calculation():
    """From plan.yaml: basic_calculation test"""
    # Test with known inputs, verify outputs are in expected range
    pass

def test_zero_income():
    """From plan.yaml: zero_income test"""
    pass

Step 9: Initialize Python Project with uv

Use uv for Python dependency management. Do NOT use requirements.txt or pip install.

cd backend
uv init --no-workspace
uv add policyengine-us  # or policyengine-uk
uv add --dev pytest

This creates a pyproject.toml with pinned dependencies and a uv.lock lockfile. Commit both files.

Modal Timeout Reference

Context	Default timeout	Max timeout	Notes
`@app.function(timeout=...)`	300s	86,400s (24h)	Set per-function
`modal serve` dev gateway	~150s	Not configurable	Returns HTTP 303 on timeout
`modal deploy` prod gateway	~150s	Not configurable	Returns HTTP 303 on timeout

US statewide microsimulations take 2-5+ minutes. This exceeds the gateway timeout, which is why synchronous HTTP calls fail for microsimulation endpoints. The gateway + polling architecture avoids this by using non-blocking job submission. Household-level simulations typically complete in 10-40s.

Cold starts: With .run_function(snapshot_models), cold starts are ~2s because the tax-benefit system is pre-loaded into the image. Without the snapshot, cold starts take 3-5 minutes as policyengine must initialize from scratch.

Pattern D: Precomputed CSV (`precomputed-csv`)

When data_pattern: precomputed-csv, the dashboard ships CSV files generated by a Python microsimulation pipeline. Follow Pattern A but use CSV files in public/data/ instead of JSON, and parse them at runtime with a lightweight CSV parser.

See the policyengine-interactive-tools-skill for examples (e.g., snap-bbce-repeal, uk-spring-statement-2026).

DO NOT

Deploy to Modal (that's /deploy-dashboard)
Change the API interface signatures after they're established
Add unnecessary dependencies
Use requirements.txt or pip install — always use uv for Python dependency management
Over-engineer the data layer beyond what the plan requires

Similar Agents

32.9k

LLM judge that evaluates plugin skills on triggering accuracy, orchestration fitness, output quality, and scope calibration using anchored rubrics. Restricted to read-only file tools.

32.9k

32.9k

Stats

Stars26

Forks5

Last CommitMar 14, 2026

Actions

View Source View Plugin View on GitHub View README

Thinking Mode

IMPORTANT: Use careful, step-by-step reasoning before taking any action. Think through:

Which data pattern the plan specifies (precomputed, policyengine-api, or custom-backend)
What endpoint interfaces or data files are needed
How to type the API contract for the frontend
What test coverage is appropriate

Backend Builder Agent

Builds the data layer for a dashboard based on the approved plan.yaml.

Skills Used

policyengine-interactive-tools-skill - Data patterns and API integration
policyengine-us-skill or policyengine-uk-skill - PolicyEngine variables
policyengine-simulation-mechanics-skill - How simulations work (custom-backend only)
policyengine-parameter-patterns-skill - Parameter YAML structure, bracket path syntax, and Reform.from_dict() paths (custom-backend only)

Backend Selection Priority

Always prefer simpler patterns. Before building a custom backend:

Can api.policyengine.org/calculate handle it? → Use Pattern B (policyengine-api)
Does it need microsimulation or custom reforms? → Use Pattern C (custom-modal) with gateway + polling
Is the parameter space finite? → Use Pattern A/D (precomputed / precomputed-csv)

Pattern C is the most complex and should be the last resort. If the plan specifies custom-modal, the plan MUST include a reason explaining why Pattern B is insufficient.

First: Load Required Skills

Before starting ANY work, use the Skill tool to load each required skill:

Skill: policyengine-interactive-tools-skill
Skill: policyengine-us-skill (if US dashboard)
Skill: policyengine-uk-skill (if UK dashboard)
Skill: policyengine-simulation-mechanics-skill (if custom-backend pattern)
Skill: policyengine-parameter-patterns-skill (if custom-backend pattern — required for correct Reform.from_dict() paths)

Input

A scaffolded repository with plan.yaml and skeleton API client
The plan specifies a data_pattern chosen from the patterns defined in the policyengine-interactive-tools-skill (loaded in the First step)

Output

Typed API client or data loader matching the plan's data pattern
React Query hooks for data fetching
Tests appropriate to the pattern
TypeScript types matching the API contract

Pattern A: Precomputed JSON (`precomputed`)

When data_pattern: precomputed, the dashboard ships static JSON files with pre-run results. No backend, no API calls at runtime.

Step 1: Create Data Files

Generate JSON files in public/data/ based on the plan's data requirements:

public/data/
  results.json        # or split by dimension:
  results_by_state.json
  results_by_year.json

Data should be structured for direct consumption by the frontend — no post-processing needed.

Step 2: Build the Data Loader

Create lib/api/client.ts:

// client.ts

export interface DashboardData {
  // Types matching the JSON structure from plan.yaml
}

export async function loadData(): Promise<DashboardData> {
  const res = await fetch('/data/results.json');
  if (!res.ok) throw new Error(`Failed to load data: ${res.status}`);
  return res.json();
}

Step 3: Build React Query Hooks

Create lib/hooks/useData.ts:

import { useQuery } from '@tanstack/react-query';
import { loadData } from '../api/client';

export function useDashboardData() {
  return useQuery({
    queryKey: ['dashboard-data'],
    queryFn: loadData,
    staleTime: Infinity,  // Static data never goes stale
  });
}

Step 4: Write Tests

Create lib/api/__tests__/client.test.ts:

Test that JSON files parse correctly
Test that data matches expected TypeScript types
Test that all expected keys/dimensions are present

Pattern B: PolicyEngine API (`policyengine-api`)

When data_pattern: policyengine-api, the dashboard calls api.policyengine.org directly for household calculations. No custom backend needed.

Step 1: Define Types

Read the plan's endpoints and generate TypeScript interfaces for the household request and response:

// types.ts

/** Household structure for the PolicyEngine API */
export interface HouseholdRequest {
  household: {
    people: Record<string, Record<string, Record<string, number | boolean | string>>>;
    tax_units: Record<string, { members: string[]; [key: string]: unknown }>;
    spm_units: Record<string, { members: string[] }>;
    households: Record<string, { members: string[]; [key: string]: unknown }>;
  };
}

/** API response from /calculate */
export interface CalculateResponse {
  status: 'ok' | 'error';
  message: string | null;
  result: Record<string, unknown>;
}

// Add dashboard-specific types for the variables in the plan

Step 2: Build the API Client

Create lib/api/client.ts:

// client.ts
const API_BASE = 'https://api.policyengine.org';

export async function calculate(
  countryId: string,
  household: HouseholdRequest['household'],
): Promise<CalculateResponse> {
  const res = await fetch(`${API_BASE}/${countryId}/calculate`, {
    method: 'POST',
    headers: { 'Content-Type': 'application/json' },
    body: JSON.stringify({ household }),
  });
  if (!res.ok) throw new Error(`API error: ${res.status}`);
  return res.json();
}

Step 3: Build React Query Hooks

Create lib/hooks/useCalculation.ts:

import { useMutation } from '@tanstack/react-query';
import { calculate } from '../api/client';
import type { HouseholdRequest } from '../api/types';

export function useCalculation(countryId: string) {
  return useMutation({
    mutationFn: (household: HouseholdRequest['household']) =>
      calculate(countryId, household),
  });
}

Step 4: Write Tests

Create lib/api/__tests__/client.test.ts:

Test that request bodies are correctly structured
Test that the client handles error responses
Test type conformance of expected response shapes

Pattern C: Custom API on Modal (`custom-modal`) — Gateway + Polling

File	Purpose	Module-level imports
`backend/_image_setup.py`	Standalone snapshot function — runs during image build	None (all inside function body)
`backend/app.py`	Modal app + function decorators	Only `modal`
`backend/simulation.py`	Pure business logic	`policyengine_us`/`_uk` (captured in image snapshot)
`backend/modal_app.py`	Lightweight gateway (FastAPI)	`modal`, `fastapi`, `pydantic`

Step 1: Look Up the Latest Country Package Version

Before writing any code, look up the latest version from PyPI. Do NOT guess or use a version from memory — these packages release frequently and stale versions will have bugs.

# For US dashboards:
pip index versions policyengine-us 2>/dev/null | head -1
# For UK dashboards:
pip index versions policyengine-uk 2>/dev/null | head -1

Use the version number returned (e.g., 1.592.4) in the pip_install() call below.

Step 2: Create Image Setup

Generate backend/_image_setup.py. This is a standalone function with no package imports at module level — it runs during image build via .run_function():

def snapshot_models():
    """Pre-load models at image build time for fast cold starts."""
    import logging
    logging.basicConfig(level=logging.INFO)
    logger = logging.getLogger(__name__)
    logger.info("Pre-loading tax-benefit system...")
    from policyengine_us import CountryTaxBenefitSystem  # or policyengine_uk
    CountryTaxBenefitSystem()
    logger.info("Models pre-loaded into image snapshot")

Step 3: Create Simulation Logic

Generate backend/simulation.py. This is pure business logic — no Modal imports. policyengine imports are at module level because they are captured in the image snapshot.

from policyengine_us import Simulation, Microsimulation  # Snapshotted at build time
from pydantic import BaseModel  # Available in image

# Pydantic models, constants, and helper functions live here.
# Each endpoint in plan.yaml gets a run_*() function.

def run_household(params: dict) -> dict:
    # Build household from params (per plan.yaml endpoints)
    sim = Simulation(situation=params["household"])
    return {"net_income": float(sim.calculate("household_net_income", 2025).sum())}

def run_statewide(params: dict) -> dict:
    baseline = Microsimulation()
    reform_sim = Microsimulation(reform=params["reform"])
    # ... compute and return impacts
    return {"revenue_change": ..., "winners": ..., "losers": ...}

Step 3b: Verify All Parameter Paths

Common mistakes:

Off-by-one indexing: Some parameters use 1-indexed keys (e.g., gov.irs.income.bracket.rates has keys 1-7, not 0-6). Always check whether the YAML uses a list (0-indexed) or explicit integer keys (use those exact keys).
Missing sub-keys on bracket scales: Bracket/scale parameters (YAML brackets: list) require .amount or .rate after the index. E.g., gov.irs.credits.eitc.max[0].amount, NOT gov.irs.credits.eitc.max[0].
Filing-status-indexed parameters: Some parameters have sub-keys by filing status (e.g., gov.irs.credits.ctc.phase_out.threshold[SINGLE]).

Verification procedure for every parameter path:

Find the YAML file in the policyengine-us (or policyengine-uk) parameters directory:

# Convert dotted path to directory path and search
find $(python3 -c "import policyengine_us; import os; print(os.path.dirname(policyengine_us.__file__))") \
  -path "*/parameters/gov/irs/income/bracket.yaml" 2>/dev/null

Read the YAML and check whether the parameter uses:
- Explicit integer keys (e.g., 1:, 2:, 3:) → use those exact indices: path[1], path[2]
- A brackets: list → use 0-indexed with sub-key: path[0].amount, path[0].rate
- Filing-status sub-keys → append [SINGLE], [JOINT], etc.

Verify programmatically (if policyengine is installed locally):

from policyengine_us import CountryTaxBenefitSystem
p = CountryTaxBenefitSystem().parameters
# Navigate and confirm the path resolves:
print(p.gov.irs.income.bracket.rates[1]("2026-01-01"))  # Should return 0.10

Do NOT guess parameter paths from memory. Always verify against the actual YAML files.

Step 4: Create Worker App

Generate backend/app.py. Only modal at module level. Imports business logic inside each function body:

import modal
from pathlib import Path
from _image_setup import snapshot_models

_BACKEND_DIR = Path(__file__).parent
app = modal.App("DASHBOARD_NAME-workers")
image = (
    modal.Image.debian_slim(python_version="3.11")
    .pip_install("policyengine-us==LATEST_VERSION", "pydantic")  # Pinned — looked up from PyPI in Step 1
    .run_function(snapshot_models)
    .add_local_file(str(_BACKEND_DIR / "simulation.py"), remote_path="/root/simulation.py")
)

@app.function(image=image, cpu=8.0, memory=32768, timeout=3600)
def compute_household(params: dict) -> dict:
    from simulation import run_household
    return run_household(params)

@app.function(image=image, cpu=8.0, memory=32768, timeout=3600)
def compute_statewide(params: dict) -> dict:
    from simulation import run_statewide
    return run_statewide(params)

Create one @app.function per endpoint in the plan. Set timeout based on the plan's worker_timeout values. Microsimulation endpoints need at least timeout=3600.

Step 5: Create Gateway

Generate backend/modal_app.py. The gateway is lightweight — no policyengine in its image. It spawns worker jobs and polls for results:

import modal
from fastapi import FastAPI, HTTPException
from fastapi.middleware.cors import CORSMiddleware
from pydantic import BaseModel

app = modal.App("DASHBOARD_NAME")

gateway_image = modal.Image.debian_slim(python_version="3.11").pip_install(
    "fastapi", "pydantic",
)

WORKER_APP = "DASHBOARD_NAME-workers"

# Map endpoint names to worker function names
FUNCTION_MAP = {
    "household-impact": "compute_household",
    "statewide-impact": "compute_statewide",
}

web_app = FastAPI()
web_app.add_middleware(
    CORSMiddleware,
    allow_origins=["*"],
    allow_methods=["*"],
    allow_headers=["*"],
)

class SubmitResponse(BaseModel):
    job_id: str

class StatusResponse(BaseModel):
    status: str  # "computing" | "ok" | "error"
    result: dict | None = None
    message: str | None = None

@web_app.post("/submit/{endpoint}")
def submit(endpoint: str, params: dict):
    if endpoint not in FUNCTION_MAP:
        raise HTTPException(status_code=404, detail=f"Unknown endpoint: {endpoint}")
    fn = modal.Function.from_name(WORKER_APP, FUNCTION_MAP[endpoint])
    call = fn.spawn(params)
    return SubmitResponse(job_id=call.object_id)

@web_app.get("/status/{job_id}")
def status(job_id: str):
    from modal.functions import FunctionCall
    call = FunctionCall.from_id(job_id)
    try:
        result = call.get(timeout=0)
        return StatusResponse(status="ok", result=result)
    except TimeoutError:
        return StatusResponse(status="computing")
    except Exception as e:
        return StatusResponse(status="error", message=str(e))

@app.function(image=gateway_image)
@modal.asgi_app()
def fastapi_app():
    return web_app

Step 6: Create Frontend Polling Client

Generate lib/api/client.ts:

const API_URL = process.env.NEXT_PUBLIC_API_URL
  || 'https://policyengine--DASHBOARD_NAME-fastapi-app.modal.run';

interface JobResponse { job_id: string }

export interface StatusResponse {
  status: 'computing' | 'ok' | 'error';
  result?: unknown;
  message?: string;
}

export async function submitJob(endpoint: string, params: unknown): Promise<string> {
  const res = await fetch(`${API_URL}/submit/${endpoint}`, {
    method: 'POST',
    headers: { 'Content-Type': 'application/json' },
    body: JSON.stringify(params),
  });
  if (!res.ok) throw new Error(`Submit failed: ${res.status}`);
  const data: JobResponse = await res.json();
  return data.job_id;
}

export async function pollStatus(jobId: string): Promise<StatusResponse> {
  const res = await fetch(`${API_URL}/status/${jobId}`);
  if (!res.ok) throw new Error(`Status check failed: ${res.status}`);
  return res.json();
}

Step 7: Build Polling React Query Hooks

Create lib/hooks/useCalculation.ts:

import { useQuery } from '@tanstack/react-query';
import { useState, useEffect } from 'react';
import { submitJob, pollStatus } from '../api/client';
import type { StatusResponse } from '../api/client';

export function useAsyncCalculation<T>(
  queryKey: unknown[],
  endpoint: string,
  params: unknown,
  options?: { enabled?: boolean },
) {
  const [jobId, setJobId] = useState<string | null>(null);

  // Reset jobId when params change
  useEffect(() => { setJobId(null); }, [JSON.stringify(params)]);

  // Step 1: Submit job
  const submit = useQuery({
    queryKey: [...queryKey, 'submit'],
    queryFn: async () => {
      const id = await submitJob(endpoint, params);
      setJobId(id);
      return id;
    },
    enabled: options?.enabled ?? true,
  });

  // Step 2: Poll for results
  const poll = useQuery<StatusResponse>({
    queryKey: [...queryKey, 'poll', jobId],
    queryFn: () => pollStatus(jobId!),
    enabled: !!jobId,
    refetchInterval: (query) =>
      query.state.data?.status === 'computing' ? 2000 : false,
  });

  return {
    isLoading: submit.isLoading || (!!jobId && poll.isLoading),
    isComputing: poll.data?.status === 'computing',
    isError: submit.isError || poll.data?.status === 'error',
    data: poll.data?.status === 'ok' ? (poll.data.result as T) : undefined,
    error: poll.data?.message || submit.error?.message,
  };
}

Step 8: Write Python Tests

Generate backend/tests/test_simulation.py from the plan's tests.api_tests:

def test_basic_calculation():
    """From plan.yaml: basic_calculation test"""
    # Test with known inputs, verify outputs are in expected range
    pass

def test_zero_income():
    """From plan.yaml: zero_income test"""
    pass

Step 9: Initialize Python Project with uv

Use uv for Python dependency management. Do NOT use requirements.txt or pip install.

cd backend
uv init --no-workspace
uv add policyengine-us  # or policyengine-uk
uv add --dev pytest

This creates a pyproject.toml with pinned dependencies and a uv.lock lockfile. Commit both files.

Modal Timeout Reference

Context	Default timeout	Max timeout	Notes
`@app.function(timeout=...)`	300s	86,400s (24h)	Set per-function
`modal serve` dev gateway	~150s	Not configurable	Returns HTTP 303 on timeout
`modal deploy` prod gateway	~150s	Not configurable	Returns HTTP 303 on timeout

Pattern D: Precomputed CSV (`precomputed-csv`)

See the policyengine-interactive-tools-skill for examples (e.g., snap-bbce-repeal, uk-spring-statement-2026).

DO NOT

Deploy to Modal (that's /deploy-dashboard)
Change the API interface signatures after they're established
Add unnecessary dependencies
Use requirements.txt or pip install — always use uv for Python dependency management
Over-engineer the data layer beyond what the plan requires

backend-builder

backend-builder

Thinking Mode

Backend Builder Agent

Skills Used

Backend Selection Priority

First: Load Required Skills

Input

Output

Pattern A: Precomputed JSON (precomputed)

Step 1: Create Data Files

Step 2: Build the Data Loader

Step 3: Build React Query Hooks

Step 4: Write Tests

Pattern B: PolicyEngine API (policyengine-api)

Step 1: Define Types

Step 2: Build the API Client

Step 3: Build React Query Hooks

Step 4: Write Tests

Pattern C: Custom API on Modal (custom-modal) — Gateway + Polling

Step 1: Look Up the Latest Country Package Version

Step 2: Create Image Setup

Step 3: Create Simulation Logic

Step 3b: Verify All Parameter Paths

Step 4: Create Worker App

Step 5: Create Gateway

Step 6: Create Frontend Polling Client

Step 7: Build Polling React Query Hooks

Step 8: Write Python Tests

Step 9: Initialize Python Project with uv

Modal Timeout Reference

Pattern D: Precomputed CSV (precomputed-csv)

DO NOT

Thinking Mode

Backend Builder Agent

Skills Used

Backend Selection Priority

First: Load Required Skills

Input

Output

Pattern A: Precomputed JSON (precomputed)

Step 1: Create Data Files

Step 2: Build the Data Loader

Step 3: Build React Query Hooks

Step 4: Write Tests

Pattern B: PolicyEngine API (policyengine-api)

Step 1: Define Types

Step 2: Build the API Client

Step 3: Build React Query Hooks

Step 4: Write Tests

Pattern C: Custom API on Modal (custom-modal) — Gateway + Polling

Step 1: Look Up the Latest Country Package Version

Step 2: Create Image Setup

Step 3: Create Simulation Logic

Step 3b: Verify All Parameter Paths

Step 4: Create Worker App

Step 5: Create Gateway

Step 6: Create Frontend Polling Client

Step 7: Build Polling React Query Hooks

Step 8: Write Python Tests

Step 9: Initialize Python Project with uv

Modal Timeout Reference

Pattern D: Precomputed CSV (precomputed-csv)

DO NOT

Pattern A: Precomputed JSON (`precomputed`)

Pattern B: PolicyEngine API (`policyengine-api`)

Pattern C: Custom API on Modal (`custom-modal`) — Gateway + Polling

Pattern D: Precomputed CSV (`precomputed-csv`)

Pattern A: Precomputed JSON (`precomputed`)

Pattern B: PolicyEngine API (`policyengine-api`)

Pattern C: Custom API on Modal (`custom-modal`) — Gateway + Polling

Pattern D: Precomputed CSV (`precomputed-csv`)