
Banana Claude
AI image generation skill for Claude Code where Claude acts as Creative Director using Google's Gemini Nano Banana models.
Unlike simple API wrappers, Claude interprets your intent, selects domain expertise, constructs optimized prompts using Google's official 5-component formula, and orchestrates Gemini for the best possible results.

Blog: See banana-claude in action
Table of Contents
Installation
Plugin Install (Recommended)
Add the marketplace and install:
/plugin marketplace add AgriciDaniel/banana-claude
/plugin install banana-claude@banana-claude-marketplace
Or test locally:
git clone --depth 1 https://github.com/AgriciDaniel/banana-claude.git
claude --plugin-dir ./banana-claude
Standalone Install (without plugin system)
git clone --depth 1 https://github.com/AgriciDaniel/banana-claude.git
bash banana-claude/install.sh
One-liner (curl):
curl -fsSL https://raw.githubusercontent.com/AgriciDaniel/banana-claude/main/install.sh | bash
With MCP Setup:
git clone --depth 1 https://github.com/AgriciDaniel/banana-claude.git
cd banana-claude
./install.sh --with-mcp YOUR_API_KEY
Get a free API key at Google AI Studio.
Quick Start
# Start Claude Code
claude
# Generate an image
/banana generate "a hero image for a coffee shop website"
# Edit an existing image
/banana edit ~/photo.png "remove the background"
# Multi-turn creative session
/banana chat
# Browse 2,500+ prompt database
/banana inspire
Claude will ask about your brand, select the right domain mode (Cinema, Product, Portrait, Editorial, UI, Logo, Landscape, Infographic, Abstract), construct a detailed prompt with lighting and composition, set the right aspect ratio, and generate.

Commands
| Command | Description |
|---|
/banana | Interactive -- Claude detects intent and guides you |
/banana generate <idea> | Full Creative Director pipeline |
/banana edit <path> <instructions> | Intelligent image editing |
/banana chat | Multi-turn visual session (maintains consistency) |
/banana inspire [category] | Browse 2,500+ prompt database |
/banana batch <idea> [N] | Generate N variations (default: 3) |
/banana setup | Configure MCP and API key |
/banana preset [list|create|show|delete] | Manage brand/style presets |
/banana cost [summary|today|estimate] | View cost tracking and estimates |
How It Works

What Makes This Different
- Intent Analysis -- Understands what you actually need (blog header? app icon? product shot?)
- Domain Expertise -- Selects the right creative lens (Cinema, Product, Portrait, Editorial, UI, Logo, Landscape, Infographic, Abstract)
- 5-Component Prompt Formula -- Constructs prompts with Subject + Action + Location/Context + Composition + Style (includes lighting)
- Prompt Adaptation -- Translates patterns from a 2,500+ curated prompt database to Gemini's natural language format
- Post-Processing -- Crops, removes backgrounds, converts formats, resizes for platforms
- Batch Variations -- Generates N variations rotating different components
- Session Consistency -- Maintains character/style across multi-turn conversations
- 4K Resolution Output -- Up to 4096×4096 with
imageSize control
- 14 Aspect Ratios -- Including ultra-wide 21:9 for cinematic compositions
The 5-Component Prompt Formula

Instead of sending "a cat in space" to Gemini, Claude constructs: