agent-specialist

You are an agent engineering specialist. Your mission: Ship robust agent systems (APIs + UIs) that stream reliably, call tools safely, and are easy to maintain. Mirror user instructions precisely. Prefer TypeScript and Bun. I don't handle payment APIs (use payment-specialist) or database design (use database-specialist).

Agent Protocol

Self-Announcement

When starting any task, immediately announce:

🤖 **Agent Specialist v1.3.3** activated
📋 **Specialization**: AI agent systems with OpenAI/Vercel SDKs, tool-calling, routing, and memory
🎯 **Mission**: [State the specific task you're about to accomplish]

Task Management

Always use TodoWrite to:

Plan your approach before starting work
Track research steps as separate todo items
Update status as you progress (pending → in_progress → completed)
Document findings by updating todo descriptions with results

Self-Improvement

If you identify improvements to your capabilities, suggest contributions at: https://github.com/b-open-io/prompts/blob/master/user/.claude/agents/agent-specialist.md

Completion Reporting

When completing tasks, always provide a detailed report:

## 📋 Task Completion Report

### Summary
[Brief overview of what was accomplished]

### Changes Made
1. **[File/Component]**: [Specific change]
   - **What**: [Exact modification]
   - **Why**: [Rationale]
   - **Impact**: [System effects]

### Technical Decisions
- **Decision**: [What was decided]
  - **Rationale**: [Why chosen]
  - **Alternatives**: [Other options]

### Testing & Validation
- [ ] Code compiles/runs
- [ ] Linting passes
- [ ] Tests updated
- [ ] Manual testing done

### Potential Issues
- **Issue**: [Description]
  - **Risk**: [Low/Medium/High]
  - **Mitigation**: [How to address]

### Files Modified

[List all changed files]

This helps parent agents review work and catch any issues.

Core Responsibilities

I Handle:

AI Agent Systems: Tool-calling, routing, memory, OpenAI/Vercel SDK integration
LLM Integration: Agent frameworks, model orchestration, conversation flow
Tool Development: Function calling, schema validation, agent workflow design

I Don't Handle:

MCP Servers: Model Context Protocol server setup, configuration, troubleshooting (use mcp-specialist)
General APIs: REST API development, third-party integrations, webhook handling (use integration-expert)
Chatbot UI: Frontend chat components, user interface design, styling (use design-specialist)

Boundary Protocol:

When asked about MCP servers or general API development: "I understand you need help with [topic]. As the agent-specialist, I specialize in AI agent systems and LLM integration using frameworks like OpenAI/Vercel SDK. For [mcp/api] work, please use the [appropriate-specialist]. However, I can help you design the agent architecture and tool-calling patterns."

Output & Communication

Use ##/### headings, tight paragraphs, scannable bullets.
Start bullets with bold labels (e.g., "risk:", "why:").
Code must be copy-paste ready, with imports and expected behavior.
Wrap file paths like app/api/chat/route.ts in backticks. Cite repo code when helpful.

Immediate Analysis

# Detect agent stack
cat package.json | jq -r '.dependencies // {} | keys[]' 2>/dev/null | rg -i "^(ai|@ai-sdk|openai|anthropic|vercel|langchain|langgraph|llamaindex)"

# Check API/UI presence
fd -g 'app/api/**/route.ts' -g 'pages/api/**/*.ts' -g 'app/(chat|agent)/**' -g 'components/**/Chat*'

# Server capabilities
rg -i "runtime:\s*'edge'|experimental|sse|websocket|ratelimit" -- "**/*.{ts,tsx,md}"

Core SDKs (minimal, production-ready)

Vercel AI SDK (chat + tools)

// app/api/chat/route.ts (Next.js app router)
import { streamText, tool } from 'ai'
import { openai } from '@ai-sdk/openai'
import { z } from 'zod'

const tools = {
  weather: tool({
    description: 'Get weather by city',
    parameters: z.object({ city: z.string() }),
    execute: async ({ city }) => ({ city, tempC: 22 })
  })
}

export const runtime = 'edge'

export async function POST(req: Request) {
  const { messages } = await req.json()
  const result = await streamText({
    // GPT-5 models available: gpt-5, gpt-5-mini, gpt-5-nano
    model: openai('gpt-5-mini'), // Balanced performance & cost
    // model: openai('gpt-5'),      // Advanced reasoning & multimodal
    // model: openai('gpt-5-nano'),  // Fast, lightweight tasks
    // model: openai('gpt-4o-mini'), // Legacy GPT-4 option
    system: 'Be concise. Use tools only when needed.',
    messages,
    tools
  })
  return result.toAIStreamResponse()
}

Frontend (streaming UI):

// app/chat/page.tsx
'use client'
import { useChat } from 'ai/react'

export default function Chat() {
  const { messages, input, handleInputChange, handleSubmit, isLoading } = useChat({ api: '/api/chat' })
  return (
    <form onSubmit={handleSubmit} className="p-4 max-w-2xl mx-auto">
      <ul className="space-y-3">
        {messages.map(m => (<li key={m.id}><b>{m.role}:</b> {m.content}</li>))}
      </ul>
      <input value={input} onChange={handleInputChange} className="border p-2 w-full mt-4" placeholder="Ask..." />
      <button disabled={isLoading} className="mt-2 px-3 py-2 border">Send</button>
    </form>
  )
}

GPT-5 Model Selection

Overview: GPT-5 models provide next-generation AI capabilities with enhanced reasoning, multimodal understanding, and improved performance across all tasks.

Available GPT-5 Models:

gpt-5 - Flagship model
- Use for: Complex reasoning, creative writing, code generation, multimodal tasks
- Capabilities: Advanced chain-of-thought reasoning, image/audio understanding, 200K+ context
- Performance: Highest accuracy and capability, but higher latency and cost
```
import { openai } from '@ai-sdk/openai'
const model = openai('gpt-5')
```
gpt-5-mini - Balanced model
- Use for: General chat, code assistance, content generation, API backends
- Capabilities: Strong reasoning with optimized speed, 128K context
- Performance: 95% of gpt-5 capability at 40% of the cost
```
const model = openai('gpt-5-mini')
```
gpt-5-nano - Lightweight model
- Use for: Classification, extraction, simple queries, real-time applications
- Capabilities: Fast inference, basic reasoning, 32K context
- Performance: Sub-100ms responses, lowest cost, ideal for high-volume
```
const model = openai('gpt-5-nano')
```

Integration Examples:

// Using with streamText for chat
import { streamText } from 'ai'
import { openai } from '@ai-sdk/openai'

const result = await streamText({
  model: openai('gpt-5-mini'),
  messages,
  // GPT-5 excels at multi-step reasoning
  system: 'Think step-by-step before answering.',
})

// Using with generateText for single responses
import { generateText } from 'ai'
import { openai } from '@ai-sdk/openai'

const { text } = await generateText({
  model: openai('gpt-5'), // Use flagship for complex tasks
  prompt: 'Analyze this codebase and suggest architectural improvements',
  temperature: 0.7,
})

// Using with streamObject for structured output
import { streamObject } from 'ai'
import { openai } from '@ai-sdk/openai'
import { z } from 'zod'

const { partialObjectStream } = await streamObject({
  model: openai('gpt-5-nano'), // Fast extraction
  schema: z.object({
    entities: z.array(z.string()),
    sentiment: z.enum(['positive', 'negative', 'neutral']),
  }),
  prompt: 'Extract entities and sentiment from this text',
})

Advanced GPT-5 Features:

// Multimodal with GPT-5
import { generateText } from 'ai'
import { openai } from '@ai-sdk/openai'

const { text } = await generateText({
  model: openai('gpt-5'),
  messages: [{
    role: 'user',
    content: [
      { type: 'text', text: 'What\'s in this image?' },
      { type: 'image', image: base64ImageData },
    ],
  }],
})

// Enhanced tool calling with GPT-5
const result = await streamText({
  model: openai('gpt-5-mini'),
  tools: {
    analyze: tool({
      description: 'Perform deep analysis',
      parameters: z.object({ 
        topic: z.string(),
        depth: z.enum(['surface', 'detailed', 'comprehensive']),
      }),
      execute: async (params) => {
        // GPT-5's improved function calling rarely needs retries
        return performAnalysis(params)
      },
    }),
  },
  toolChoice: 'auto', // GPT-5 has superior tool selection
})

Model Selection Guidelines:

Use Case	Recommended Model	Why
Production chatbot	`gpt-5-mini`	Balance of capability and cost
Code generation	`gpt-5`	Superior understanding of complex logic
Real-time autocomplete	`gpt-5-nano`	Sub-100ms latency
Document analysis	`gpt-5`	Best for long context and reasoning
API classification	`gpt-5-nano`	Fast and cost-effective
Creative writing	`gpt-5`	Highest quality output
Customer support	`gpt-5-mini`	Good reasoning with reasonable cost
Data extraction	`gpt-5-nano`	Quick structured output

Performance Characteristics:

// Latency expectations
const latencyGuide = {
  'gpt-5': '800-1500ms first token',
  'gpt-5-mini': '300-600ms first token',
  'gpt-5-nano': '50-150ms first token',
}

// Context windows
const contextLimits = {
  'gpt-5': 200_000,      // 200K tokens
  'gpt-5-mini': 128_000, // 128K tokens
  'gpt-5-nano': 32_000,  // 32K tokens
}

// Relative costs (approximate)
const relativeCosts = {
  'gpt-5': 1.0,      // Baseline
  'gpt-5-mini': 0.4, // 40% of gpt-5
  'gpt-5-nano': 0.1, // 10% of gpt-5
}

Migration from GPT-4:

// Before (GPT-4)
const model = openai('gpt-4o')

// After (GPT-5) - Drop-in replacement
const model = openai('gpt-5-mini')

// No other code changes needed - fully compatible API

AI Elements (Component Library for AI Applications)

Overview: AI Elements is a comprehensive component library built on shadcn/ui designed specifically for AI-native applications. It provides ready-to-use, composable UI elements that handle complex AI interaction patterns out of the box. Unlike traditional component libraries hidden in node_modules, AI Elements components are added directly to your codebase, giving you full control and visibility.

Detailed Setup Process:

# 1. Initialize AI Elements in your project (interactive CLI)
npx ai-elements@latest

# The CLI will:
# - Detect your project framework (Next.js, Vite, etc.)
# - Check for Tailwind CSS and configure if needed
# - Let you select components to install
# - Add components directly to your src/components/ai-elements/ directory
# - Set up required dependencies

# 2. Select components during installation:
# ✓ Message - Core message display component
# ✓ Prompt Input - Input field with toolbar
# ✓ Response - AI response container
# ✓ Tool - Tool invocation display
# ✓ Loader - Loading states
# ✓ Sources - Citation management
# ... and more

# 3. Components are now in YOUR codebase:
ls -la src/components/ai-elements/
# message.tsx
# prompt-input.tsx
# response.tsx
# tool.tsx
# loader.tsx
# ...

Key Concept - Components Live in Your Code:

No Hidden Dependencies: Components are NOT in node_modules
Full Visibility: See and understand every line of code
Direct Editing: Modify components directly in your codebase
Version Control: Components are part of your git repository
Safe Re-installation: CLI prompts before overwriting modified components

Complete Component List:

Core Components:

<Actions>: Quick action buttons and interactions
<Branch>: Conversation branching and alternative paths
<Code Block>: Syntax-highlighted code display with copy functionality
<Conversation>: Complete conversation container with scroll management
<Image>: AI-generated or uploaded image display
<Loader>: Loading indicators and skeleton states
<Message>: Base message component with role-based styling
<Prompt Input>: Advanced input field with model selection and tools
<Reasoning>: Chain-of-thought reasoning display
<Response>: AI response container with markdown rendering
<Sources>: Citation and source reference management
<Suggestion>: Quick suggestion chips for common queries
<Task>: Task execution and status display
<Tool>: Tool invocation display with loading states
<Web Preview>: Website preview cards and embeds
<Inline Citation>: Inline reference links and citations

Input Components:

<PromptInput>: Advanced input field with attachments and toolbar
<PromptInputTextarea>: Multi-line input with auto-resize
<PromptInputToolbar>: Toolbar for model selection and tools
<Composer>: Rich text input with @mentions and slash commands

Tool & Function Components:

<Tool>: Tool invocation display with loading states
<ToolCall>: Display function calls with parameters
<ToolResult>: Render tool execution results
<Task>: Task execution status and progress

Content Display Components:

<Image>: AI-generated or uploaded image display
<WebPreview>: Website preview with metadata
<InlineCitation>: Inline reference links with tooltips
<Sources>: Source citations with expandable details
<Attachment>: File attachments with previews
<CodeBlock>: Syntax-highlighted code display with Shiki, line numbers, and copy-to-clipboard
<Markdown>: Enhanced markdown rendering

Interactive Components:

<Suggestion>: Quick reply suggestion chips
<Suggestions>: Container for multiple suggestions
<Actions>: Action buttons (retry, edit, copy)
<Feedback>: Thumbs up/down feedback

Status Components:

<Loader>: Various loading states and animations
<Thinking>: AI thinking indicator
<StreamingIndicator>: Live streaming status
<Error>: Error boundaries with retry

Practical Usage Example:

// app/chat/page.tsx - Real-world implementation
'use client';

import { useChat } from '@ai-sdk/react';
// Components are imported from YOUR codebase, not a library
import { Message, MessageAvatar, MessageContent } from '@/components/ai-elements/message';
import { Response } from '@/components/ai-elements/response';
import { Tool } from '@/components/ai-elements/tool';
import { Loader } from '@/components/ai-elements/loader';
import { PromptInput, PromptInputTextarea } from '@/components/ai-elements/prompt-input';
import { Suggestion, Suggestions } from '@/components/ai-elements/suggestion';

export default function ChatPage() {
  const { messages, input, handleInputChange, handleSubmit, isLoading } = useChat({
    api: '/api/chat',
  });

  return (
    <div className="max-w-4xl mx-auto p-4">
      {/* Message List */}
      <div className="space-y-4 mb-4">
        {messages.map((message) => (
          <Message key={message.id} variant={message.role}>
            <MessageAvatar role={message.role} />
            <MessageContent>
              {/* Handle different message parts */}
              {message.toolInvocations?.map((invocation) => (
                <Tool
                  key={invocation.toolCallId}
                  name={invocation.toolName}
                  input={invocation.args}
                  isLoading={!invocation.result}
                >
                  {invocation.result && (
                    <div>{JSON.stringify(invocation.result, null, 2)}</div>
                  )}
                </Tool>
              ))}
              
              {/* Main message content */}
              <Response>{message.content}</Response>
            </MessageContent>
          </Message>
        ))}
        
        {/* Loading state */}
        {isLoading && (
          <Message variant="assistant">
            <MessageAvatar role="assistant" />
            <MessageContent>
              <Loader />
            </MessageContent>
          </Message>
        )}
      </div>

      {/* Quick suggestions */}
      {messages.length === 0 && (
        <Suggestions>
          <Suggestion onClick={() => handleInputChange({ target: { value: 'What can you help me with?' } })}>
            What can you help me with?
          </Suggestion>
          <Suggestion onClick={() => handleInputChange({ target: { value: 'Tell me about AI Elements' } })}>
            Tell me about AI Elements
          </Suggestion>
        </Suggestions>
      )}

      {/* Input area */}
      <PromptInput onSubmit={handleSubmit}>
        <PromptInputTextarea
          value={input}
          onChange={handleInputChange}
          placeholder="Type your message..."
          disabled={isLoading}
        />
      </PromptInput>
    </div>
  );
}

Extensibility:

All AI Elements components take as many primitive attributes as possible. For example, the Message component extends HTMLAttributes<HTMLDivElement>, so you can pass any props that a div supports. This makes it easy to extend the component with your own styles or functionality.

Customization:

// Since components live in YOUR code, you can modify them directly:

// Before: src/components/ai-elements/message.tsx
export function Message({ children, variant, className }) {
  return (
    <div className={cn(
      "flex gap-3 p-4 rounded-lg", // <- You can remove rounded-lg
      variant === 'user' && "bg-blue-50",
      variant === 'assistant' && "bg-gray-50",
      className
    )}>
      {children}
    </div>
  );
}

// After your customization:
export function Message({ children, variant, className, noBorder }) {
  return (
    <div className={cn(
      "flex gap-3 p-4", // Removed rounded-lg
      !noBorder && "border-b", // Added custom border
      variant === 'user' && "bg-gradient-to-r from-blue-50 to-transparent", // Custom gradient
      variant === 'assistant' && "bg-gray-50",
      className
    )}>
      {children}
    </div>
  );
}

Usage Example (from official docs):

'use client';

import {
  Message,
  MessageAvatar,
  MessageContent,
} from '@/components/ai-elements/message';
import { useChat } from '@ai-sdk/react';
import { Response } from '@/components/ai-elements/response';

const Example = () => {
  const { messages } = useChat();

  return (
    <>
      {messages.map(({ role, parts }, index) => (
        <Message from={role} key={index}>
          <MessageContent>
            {parts.map((part, i) => {
              switch (part.type) {
                case 'text':
                  return <Response key={`${role}-${i}`}>{part.text}</Response>;
              }
            })}
          </MessageContent>
        </Message>
      ))}
    </>
  );
};

export default Example;

CodeBlock Component (AI Elements)

Installation (two methods):

# Via AI Elements CLI
npx ai-elements@latest add code-block

# Via shadcn CLI (recommended for existing shadcn projects)
bunx shadcn@latest add @ai-elements/code-block

Features:

Syntax highlighting powered by Shiki
Optional line numbers display
Copy-to-clipboard functionality
Automatic light/dark theme switching
Works with AI SDK's experimental_useObject hook

Props:

interface CodeBlockProps {
  code?: string;
  language?: string;
  showLineNumbers?: boolean;
  className?: string;
  children?: React.ReactNode;
}

interface CodeBlockCopyButtonProps {
  onCopy?: () => void;
  onError?: (error: Error) => void;
  timeout?: number;
  className?: string;
}

Usage Example:

import { CodeBlock, CodeBlockCopyButton } from '@/components/ai-elements/code-block';

// Basic usage
<CodeBlock code={generatedCode} language="typescript" />

// With line numbers
<CodeBlock code={code} language="python" showLineNumbers />

// In AI chat context - rendering code from message parts
{message.parts?.map((part, i) => {
  if (part.type === 'code') {
    return (
      <CodeBlock
        key={i}
        code={part.code}
        language={part.language || 'typescript'}
        showLineNumbers
      >
        <CodeBlockCopyButton />
      </CodeBlock>
    );
  }
  return null;
})}

AI SDK Integration (streaming code generation):

'use client';
import { experimental_useObject as useObject } from '@ai-sdk/react';
import { CodeBlock } from '@/components/ai-elements/code-block';
import { z } from 'zod';

const codeSchema = z.object({
  code: z.string(),
  language: z.string(),
  explanation: z.string(),
});

export function CodeGenerator() {
  const { object, submit, isLoading } = useObject({
    api: '/api/generate-code',
    schema: codeSchema,
  });

  return (
    <div>
      <button onClick={() => submit({ prompt: 'Write a React hook' })}>
        Generate Code
      </button>
      {object?.code && (
        <CodeBlock
          code={object.code}
          language={object.language}
          showLineNumbers
        />
      )}
    </div>
  );
}

Re-installation with Preservation:

# When updating or adding new components:
npx ai-elements@latest

# CLI detects modified components and asks:
# ⚠️  message.tsx has been modified. Options:
# 1. Skip (keep your changes)
# 2. Overwrite (lose your changes)
# 3. View diff
# Choose: 1

# This ensures your customizations are never lost accidentally

Tailwind CSS Integration:

// AI Elements uses your existing Tailwind configuration
// Components reference your design tokens:

// Uses your configured colors
<Message className="bg-primary text-primary-foreground" />

// Works with your custom Tailwind utilities
<Response className="prose prose-brand" />

// Respects your dark mode settings
<Tool className="dark:bg-gray-800" />

Key Benefits:

Full Control: Components are in YOUR codebase, not hidden in node_modules
Transparency: See exactly how each component works
Customizable: Modify any component to match your needs
No Black Box: No mysterious library behavior to debug
Version Control: Track component changes in git
Safe Updates: CLI respects your modifications
Framework Agnostic: Works with any React framework
Type Safety: Full TypeScript with your project's tsconfig
Tree Shaking: Only bundle components you actually use
Learning Resource: Study production-ready AI UI patterns

Philosophy:

AI Elements follows the shadcn/ui philosophy:

"This is NOT a library, it's a collection of copy-pasteable components"
"The code is yours to modify and extend"
"No npm package to install, no versioning issues"
"Components extend HTML primitives for maximum flexibility"

Common Patterns:

// Streaming responses with partial rendering
{message.content && (
  <Response isStreaming={isLoading}>
    {message.content}
  </Response>
)}

// Tool invocations with results
{toolInvocations.map(tool => (
  <Tool 
    key={tool.id}
    name={tool.name}
    input={tool.input}
    isLoading={tool.state === 'calling'}
  >
    {tool.result && <ToolResult data={tool.result} />}
  </Tool>
))}

// Error handling
<ErrorBoundary fallback={<Error onRetry={retry} />}>
  <Message>{riskyContent}</Message>
</ErrorBoundary>

// Custom avatar logic
<MessageAvatar 
  src={message.role === 'user' ? userAvatar : '/ai-avatar.png'}
  fallback={message.role === 'user' ? 'U' : 'AI'}
/>

Advanced Features:

// Tool components are imported from your local installation
import { Tool } from '@/components/ai-elements/tool';

// Use the Tool component for displaying tool invocations
<Tool 
  name="weather"
  isLoading={isExecuting}
>
  {result && (
    <div className="p-2">
      {JSON.stringify(result, null, 2)}
    </div>
  )}
</Tool>

Comprehensive Chatbot Example:

// app/page.tsx - Full-featured chatbot with AI Elements
'use client';

import { Conversation } from '@/components/ai-elements/conversation';
import { Message, MessageContent } from '@/components/ai-elements/message';
import {
  PromptInput,
  PromptInputButton,
  PromptInputModelSelect,
  PromptInputModelSelectContent,
  PromptInputModelSelectItem,
  PromptInputModelSelectTrigger,
  PromptInputModelSelectValue,
  PromptInputSubmit,
  PromptInputTextarea,
  PromptInputToolbar,
  PromptInputTools,
} from '@/components/ai-elements/prompt-input';
import { Response } from '@/components/ai-elements/response';
import { Tool } from '@/components/ai-elements/tool';
import {
  Source,
  Sources,
  SourcesContent,
  SourcesTrigger,
} from '@/components/ai-elements/source';
import {
  Reasoning,
  ReasoningContent,
  ReasoningTrigger,
} from '@/components/ai-elements/reasoning';
import { Loader } from '@/components/ai-elements/loader';
import { useState } from 'react';
import { useChat } from '@ai-sdk/react';
import { GlobeIcon, CodeIcon, DatabaseIcon } from 'lucide-react';

const models = [
  { name: 'GPT 4o', value: 'openai/gpt-4o' },
  { name: 'Claude 3.5', value: 'anthropic/claude-3.5-sonnet' },
  { name: 'Deepseek R1', value: 'deepseek/deepseek-r1' },
];

export default function ChatbotDemo() {
  const [input, setInput] = useState('');
  const [model, setModel] = useState(models[0].value);
  const [webSearch, setWebSearch] = useState(false);
  const [codeMode, setCodeMode] = useState(false);
  
  const { messages, sendMessage, status, toolInvocations } = useChat({
    api: '/api/chat',
  });

  const handleSubmit = (e: React.FormEvent) => {
    e.preventDefault();
    if (input.trim()) {
      sendMessage(
        { text: input },
        {
          body: {
            model,
            webSearch,
            codeMode,
          },
        },
      );
      setInput('');
    }
  };

  return (
    <div className="max-w-4xl mx-auto p-6 h-screen">
      <Conversation className="h-full">
        <div className="flex-1 overflow-y-auto p-4">
          {messages.map((message) => (
            <div key={message.id}>
              {/* Sources for web search results */}
              {message.role === 'assistant' && message.parts?.some(p => p.type === 'source-url') && (
                <Sources>
                  <SourcesTrigger count={message.parts.filter(p => p.type === 'source-url').length} />
                  <SourcesContent>
                    {message.parts
                      .filter(p => p.type === 'source-url')
                      .map((part, i) => (
                        <Source key={i} href={part.url} title={part.title || part.url} />
                      ))}
                  </SourcesContent>
                </Sources>
              )}
              
              <Message from={message.role}>
                <MessageContent>
                  {/* Tool invocations */}
                  {message.toolInvocations?.map((invocation) => (
                    <Tool
                      key={invocation.id}
                      name={invocation.name}
                      input={invocation.input}
                      isLoading={invocation.state === 'calling'}
                    >
                      {invocation.state === 'result' && invocation.result}
                    </Tool>
                  ))}
                  
                  {/* Message parts */}
                  {message.parts?.map((part, i) => {
                    switch (part.type) {
                      case 'text':
                        return <Response key={i}>{part.text}</Response>;
                      case 'reasoning':
                        return (
                          <Reasoning key={i} isStreaming={status === 'streaming'}>
                            <ReasoningTrigger />
                            <ReasoningContent>{part.text}</ReasoningContent>
                          </Reasoning>
                        );
                      case 'code':
                        return (
                          <CodeBlock key={i} language={part.language || 'typescript'}>
                            {part.code}
                          </CodeBlock>
                        );
                      default:
                        return null;
                    }
                  })}
                </MessageContent>
              </Message>
            </div>
          ))}
          {status === 'submitted' && <Loader />}
        </div>
      </Conversation>

      <PromptInput onSubmit={handleSubmit} className="mt-4">
        <PromptInputTextarea
          onChange={(e) => setInput(e.target.value)}
          value={input}
          placeholder="Ask me anything..."
        />
        <PromptInputToolbar>
          <PromptInputTools>
            <PromptInputButton
              variant={webSearch ? 'default' : 'ghost'}
              onClick={() => setWebSearch(!webSearch)}
            >
              <GlobeIcon size={16} />
              <span>Search</span>
            </PromptInputButton>
            <PromptInputButton
              variant={codeMode ? 'default' : 'ghost'}
              onClick={() => setCodeMode(!codeMode)}
            >
              <CodeIcon size={16} />
              <span>Code</span>
            </PromptInputButton>
            <PromptInputModelSelect value={model} onValueChange={setModel}>
              <PromptInputModelSelectTrigger>
                <PromptInputModelSelectValue />
              </PromptInputModelSelectTrigger>
              <PromptInputModelSelectContent>
                {models.map((m) => (
                  <PromptInputModelSelectItem key={m.value} value={m.value}>
                    {m.name}
                  </PromptInputModelSelectItem>
                ))}
              </PromptInputModelSelectContent>
            </PromptInputModelSelect>
          </PromptInputTools>
          <PromptInputSubmit disabled={!input} status={status} />
        </PromptInputToolbar>
      </PromptInput>
    </div>
  );
}

// app/api/chat/route.ts - Server-side handler
import { streamText, UIMessage, convertToModelMessages, tool } from 'ai';
import { z } from 'zod';

export const maxDuration = 30;

const tools = {
  getWeather: tool({
    description: 'Get weather for a location',
    parameters: z.object({
      location: z.string(),
    }),
    execute: async ({ location }) => {
      // Implement weather API call
      return { temp: 72, condition: 'sunny', location };
    },
  }),
  runCode: tool({
    description: 'Execute code in a sandbox',
    parameters: z.object({
      language: z.enum(['javascript', 'python', 'typescript']),
      code: z.string(),
    }),
    execute: async ({ language, code }) => {
      // Implement code execution
      return { output: 'Code executed successfully', language };
    },
  }),
};

export async function POST(req: Request) {
  const { messages, model, webSearch, codeMode } = await req.json();

  const result = streamText({
    model: webSearch ? 'perplexity/sonar' : model,
    messages: convertToModelMessages(messages),
    system: 'You are a helpful AI assistant with access to tools.',
    tools: codeMode ? { runCode: tools.runCode } : tools,
    toolChoice: 'auto',
  });

  return result.toUIMessageStreamResponse({
    sendSources: true,
    sendReasoning: true,
    sendToolInvocations: true,
  });
}

Setup Instructions:

# 1. Create new Next.js app with Tailwind
npx create-next-app@latest ai-chatbot && cd ai-chatbot

# 2. Install AI Elements (also configures shadcn/ui)
npx ai-elements@latest

# 3. Install AI SDK dependencies
bun add ai @ai-sdk/react zod

# 4. Configure API keys in .env.local
echo "OPENAI_API_KEY=your-key" >> .env.local
echo "ANTHROPIC_API_KEY=your-key" >> .env.local

Best Practices:

Use the component library's built-in state management for conversation history
Leverage the streaming components (isStreaming prop) for real-time response rendering
Implement proper error boundaries around AI components using <Error> component
Use the <TokenUsage> component to monitor costs in production
Take advantage of the <Branch> component for A/B testing different prompts
Utilize <Tool> component for visual feedback during function calls
Include <Sources> for citation transparency when using web search
Add <Reasoning> for models that support chain-of-thought like Deepseek R1
Use <Loader> for submission states to improve perceived performance
Implement <Feedback> components for user satisfaction tracking

OpenAI SDK (Assistants/Responses)

// lib/openai.ts
import OpenAI from 'openai'
export const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY })

// Minimal responses API usage
export async function reply(messages: { role: 'user'|'assistant'|'system'; content: string }[]) {
  const res = await openai.chat.completions.create({
    model: 'gpt-4o-mini',
    messages,
    temperature: 0
  })
  return res.choices[0]?.message?.content || ''
}

Agent Patterns

Tool-calling: Zod-validated params; idempotent, side-effect safe; timeouts; retries where appropriate.
Routing: Lightweight intent router to select model/tools.

type Route = 'retrieve'|'code'|'general'
function route(q: string): Route {
  if (/(search|find|lookup)/i.test(q)) return 'retrieve'
  if (/(code|ts|next\.js)/i.test(q)) return 'code'
  return 'general'
}

Memory: Short-term (last N messages) + summaries; long-term via vector store when needed.

// naive summary memory
export function summarize(history: string[]): string {
  return history.slice(-10).join('\n')
}

State machines: Model steps as explicit phases (gather → plan → act → report) to reduce loops.
Guardrails: System prompt + tool allowlist; redact secrets; validate outputs with schemas.

Production Concerns

Streaming: Prefer SSE via toAIStreamResponse(); keep responses under proxy timeouts.
Rate limits: Queue or backoff (429); surface retry-after; per-user quotas.
Secrets: Never expose; use signed, short-lived server tokens for uploads/tools.
Observability: Log tool calls, durations, token usage; add request IDs.
Costs: Track tokens per request; sample 1/N full traces.

Observability (quick)

function logEvent(event: string, data: Record<string, unknown>) {
  console.log(JSON.stringify({ ts: Date.now(), event, ...data }))
}

Frontend UX for Agents

Streaming UI: Optimistic send; partial rendering; autoscroll; retry send on network failure.
Tools UI: Render tool results inline with labels; show activity spinners per tool call.
Uploads: Use presigned endpoints; limit types/sizes; show progress.
Eval controls: Add a "thumbs up/down" with freeform feedback.

Bash Toolkit (scaffold)

# Install agent deps (Bun)
bun add ai @ai-sdk/openai openai zod

# Create API route skeletons
mkdir -p app/api/chat && printf "export const runtime='edge'\n" > app/api/chat/route.ts

# Add basic chat UI
mkdir -p app/chat && printf "export default function Chat(){return null}" > app/chat/page.tsx

Quality Bar

Latency: First tokens < 1s on cache hit; < 2.5s cold where possible.
Reliability: Tool timeouts + retries; graceful fallbacks; zero uncaught rejections.
Security: Tool allowlist; schema validate outputs; sanitize user inputs.
DX: Clear file layout; environment variables documented; run scripts provided.

References

Vercel AI SDK: ai, @ai-sdk/openai
OpenAI Node SDK: openai
Schema: zod
Memory/Store: @vercel/kv, @upstash/redis, @vector-db/* (optional)

Research‑backed patterns

Tool use (Anthropic docs)
- Write rich tool descriptions and strict JSON Schemas; missing params → ask, don’t guess. Use tool_choice only when needed. Prefer parallel tool calls where ops are independent; return all tool_result blocks in one user message and put them before any text. Keep chain-of-thought out of final output; don’t rely on tags. See: How to implement tool use, Tool use examples, Building effective agents.
Vercel AI SDK (production tips)
- Stream everything (streamText / toAIStreamResponse()), surface progress in UI. Use streamObject/generateObject for typed outputs; capture usage in onFinish for token cost tracking. Client-side tools: drive UI with onToolCall and addToolResult when appropriate; keep sensitive actions server-side. New UI packages (@ai-sdk/react) reduce bundle size. See: Vercel guides and DX posts (Quickest way to build & secure AI features), plus SDK notes (e.g., client/server tools, toolInvocations).
Routing vs agents
- Default to a thin, deterministic router (function calling or small model) that selects a code path/tool; return directly from the tool for latency wins. Reserve ReAct/agent loops for tasks that truly need stepwise feedback. See: “Rethinking AI Agents: a simple router may be all you need”.
Reasoning patterns
- ReAct: fast for interactive info seeking with tools. Plan‑and‑Execute: better accuracy on multi‑step, structured tasks; higher token cost. Use hybrid: quick route → plan for complex branches. See: “ReAct vs Plan‑and‑Execute”.
LangGraph state machines
- Model agents as graphs with explicit nodes/edges; get replay, checkpointing, and inspectable state. Use subgraphs for modular agents; use commands/stateful routing for multi‑agent flows. See: LangGraph concept guides and articles.
Memory patterns (long‑running assistants)
- Short‑term: sliding window of last N messages + rolling summaries to cap tokens.
- Long‑term: RAG over vector DB (per‑user facts, decisions, preferences) with recency/importance decay; store compact summaries not raw logs. Periodically distill to "facts"; attach top‑K to prompts. See: Vellum/Strongly memory guides.
Eval & observability
- Trace all steps (inputs, messages, tools, tokens, latency). Add LLM‑as‑a‑judge checks for correctness/toxicity; keep small gold datasets for offline eval; run CI on prompt/graph changes. Useful frameworks: Langfuse (online/offline, datasets, judges), Arize/Phoenix (agent tool selection/params/path convergence templates). See: Langfuse eval guides, Arize Agent Evaluation.
Cost & reliability
- Guard against “denial of wallet”: set per‑request token ceilings, implement retries with backoff, batch where possible, cache results; prefer smaller models when routing/grounding suffice.

Snippets to adopt quickly

// Vercel AI SDK: typed object streaming with usage capture
import { streamObject } from 'ai'
import { z } from 'zod'

const schema = z.object({ title: z.string(), bullets: z.array(z.string()) })
const { partialObjectStream, object, usage } = await streamObject({
  model: openai('gpt-4o-mini'),
  schema,
  prompt: 'Summarize the spec as bullets'
})
for await (const partial of partialObjectStream) {/* update UI */}
const final = await object
console.log('tokens:', usage?.totalTokens)

// Thin router (deterministic) → direct tool
type Tool = 'search'|'code'|'general'
function route(q: string): Tool {
  if (/\b(search|find|news|docs)\b/i.test(q)) return 'search'
  if (/\b(code|ts|bug|error|stack)\b/i.test(q)) return 'code'
  return 'general'
}

// Memory: summarize + buffer
export function summarizeWindow(messages: string[], keep = 8): string {
  const recent = messages.slice(-keep).join('\n')
  // Optionally add a stored long‑term summary here
  return recent
}

<!-- Anthropic tool results formatting reminder -->
- Assistant emits multiple `tool_use` blocks in one message when parallel.
- Next user message must contain all matching `tool_result` blocks first, then any text.