Skill

nano-banana

AI image generation using Nano Banana PRO (Gemini 3 Pro Image) and Nano Banana (Gemini 2.5 Flash Image). Use this skill when: (1) Generating images from text prompts, (2) Editing existing images, (3) Creating professional visual assets like infographics, logos, product shots, stickers, (4) Working with character consistency across multiple images, (5) Creating images with accurate text rendering, (6) Any task requiring AI-generated visuals. Triggers on: 'generate image', 'create image', 'make a picture', 'design a logo', 'create infographic', 'AI image', 'nano banana', or any image generation request.

npx claudepluginhub horuz-ai/claude-plugins --plugin google

Tool Access

This skill uses the workspace's default tool permissions.

Preview

Generate professional AI images using Google's Nano Banana models via the Gemini API.

Supporting Assets

references/api-reference.mdreferences/prompting-guide.mdscripts/generate-image.shscripts/generate_image.py

SKILL.md

Similar Skills

generate-image

247

Generates AI images from text prompts, edits images, and composes from multiple references using Gemini models. Supports t2i, i2i, product mockups, and stickers.

4 files3 tools

claude-content

Nano Banana Prompting

111

Guides interactive prompt crafting for Nano Banana Pro Gemini image generation via reference gathering, intent-clarifying questions, and technique selection.

1 file

handbook-nano-banana

banana

234

Generates and edits AI images using Google Gemini Nano Banana models. Orchestrates text-to-image, image editing, batch workflows, presets, and creative sessions via /banana or auto-triggers on image requests.

13 files

banana-claude

Stats

Parent Repo Stars3

Parent Repo Forks2

Last CommitJan 1, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Nano Banana PRO Image Generation

Generate professional AI images using Google's Nano Banana models via the Gemini API.

Prerequisites

API key must be set as GEMINI_API_KEY environment variable
Uses curl for all API calls (no SDK required)

Model Selection

Model	Identifier	Best For
Nano Banana PRO	`gemini-3-pro-image-preview`	Professional assets, text rendering, infographics, 4K output, complex multi-turn editing
Nano Banana	`gemini-2.5-flash-image`	Fast generation, simple edits, lower cost

Default to PRO for quality work. Use Flash for rapid iterations or simple tasks.

CRITICAL: Prompt Engineering First

BEFORE calling the API, always craft an effective prompt. Read references/prompting-guide.md for comprehensive prompting strategies. Key principles:

The Golden Rules

Describe scenes, don't list keywords - Write narrative descriptions, not tag soup
Use natural language - Full sentences with proper grammar
Be specific - Define subject, setting, lighting, mood, materials
Provide context - The "why" helps the model make better artistic decisions
Edit, don't re-roll - If 80% correct, ask for specific changes

The ICS Framework (Quick Reference)

For any image, specify:

Image type: What kind of visual (photo, infographic, logo, sticker, etc.)
Content: Specific elements, data, or information to include
Style: Visual style, color palette, artistic approach

API Reference

Text-to-Image Generation

curl -s -X POST \
  "https://generativelanguage.googleapis.com/v1beta/models/gemini-3-pro-image-preview:generateContent" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [{
      "parts": [{"text": "YOUR_PROMPT_HERE"}]
    }],
    "generationConfig": {
      "responseModalities": ["TEXT", "IMAGE"],
      "imageConfig": {
        "aspectRatio": "16:9",
        "imageSize": "2K"
      }
    }
  }'

Image Editing (with input image)

curl -s -X POST \
  "https://generativelanguage.googleapis.com/v1beta/models/gemini-3-pro-image-preview:generateContent" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [{
      "parts": [
        {"text": "YOUR_EDIT_INSTRUCTION"},
        {"inline_data": {"mime_type": "image/png", "data": "BASE64_IMAGE_DATA"}}
      ]
    }],
    "generationConfig": {
      "responseModalities": ["TEXT", "IMAGE"]
    }
  }'

Configuration Options

Parameter	Values	Notes
`aspectRatio`	`1:1`, `2:3`, `3:2`, `3:4`, `4:3`, `4:5`, `5:4`, `9:16`, `16:9`, `21:9`	Match use case
`imageSize`	`1K`, `2K`, `4K`	Use uppercase K; PRO model only for 4K

Google Search Grounding (Real-time Data)

Add "tools": [{"google_search": {}}] to generate images based on current information:

curl -s -X POST \
  "https://generativelanguage.googleapis.com/v1beta/models/gemini-3-pro-image-preview:generateContent" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [{"parts": [{"text": "Create an infographic of current tech stock prices"}]}],
    "tools": [{"google_search": {}}],
    "generationConfig": {
      "responseModalities": ["TEXT", "IMAGE"],
      "imageConfig": {"aspectRatio": "16:9"}
    }
  }'

Workflow

Step 1: Craft the Prompt

Use the ICS framework and prompting guide. Examples:

Photorealistic:

A photorealistic close-up portrait of an elderly Japanese ceramicist with deep wrinkles and a warm smile, inspecting a tea bowl. Soft golden hour light from a window. 85mm lens, shallow depth of field. Serene mood.

Infographic:

Create a clean, modern infographic explaining photosynthesis as a recipe. Show "ingredients" (sunlight, water, CO2) and "finished dish" (energy). Style like a colorful kids' cookbook page.

Product Shot:

High-resolution studio photograph of a matte black ceramic coffee mug on polished concrete. Three-point softbox lighting, 45-degree angle, sharp focus on rising steam. Square format.

Step 2: Generate Image

Use scripts/generate-image.sh or call API directly:

./scripts/generate-image.sh "Your prompt here" output.png --ratio 16:9 --size 2K

Step 3: Process Response

The API returns base64-encoded image data. Extract and decode:

# Response contains: {"candidates":[{"content":{"parts":[{"inlineData":{"mimeType":"image/png","data":"BASE64..."}}]}}]}
# Extract with jq and decode:
cat response.json | jq -r '.candidates[0].content.parts[] | select(.inlineData) | .inlineData.data' | base64 -d > image.png

Common Use Cases

Landing Pages & Ads

Use 16:9 or 21:9 for hero images
Specify brand colors, modern/minimal style
Include text requirements in prompt

Logos & Icons

Use 1:1 aspect ratio
Request "minimalist", "clean lines", "vector-style"
Specify color scheme explicitly

Product Photography

Describe lighting setup (softbox, natural, studio)
Mention surface/background materials
Include camera angle and lens type

Infographics

Define data to visualize
Specify style (corporate, playful, technical)
Request clear text and labeled sections

Stickers & Illustrations

Request "bold outlines", "kawaii", "cel-shading"
Specify "white background" or "transparent background"
Define color palette

Character Consistency (Multiple Images)

PRO supports up to 14 reference images
Explicitly state: "Keep facial features exactly the same as Image 1"
Describe expression/pose changes while maintaining identity

Scripts

See scripts/generate-image.sh for a ready-to-use generation script.

Detailed Prompting Guide

For advanced techniques including:

Photorealistic scene templates
Text rendering best practices
Sequential art and storyboarding
Dimensional translation (2D↔3D)
Search grounding for real-time data

Read references/prompting-guide.md.