Generate and edit images using AI models via OpenRouter. Supports Nano Banana Pro (Gemini 3 Pro Image), FLUX, and other image generation models.
Generate and edit images using AI models like Gemini 3 Pro Image and FLUX via OpenRouter. Claude will use this when you need visual assets, concept art, or to modify existing images with natural language prompts.
/plugin marketplace add flight505/nano-banana/plugin install nano-banana@nano-bananaThis skill is limited to using the following tools:
__init__.pyscripts/__init__.pyscripts/generate_image.pyGenerate and edit images using state-of-the-art AI models. Perfect for creating visual assets, concept art, illustrations, and editing existing images.
Key Features:
Use this skill when you need:
Note: For technical diagrams (architecture, flowcharts, ERD), use the diagram skill insteadβit includes quality review and iteration.
# Generate a new image
python skills/image/scripts/generate_image.py "A beautiful sunset over mountains with orange and purple sky" -o sunset.png
# Edit an existing image
python skills/image/scripts/generate_image.py "Make the sky more dramatic with storm clouds" --input sunset.png -o dramatic_sunset.png
# Use a specific model
python skills/image/scripts/generate_image.py "Abstract geometric art in blue and gold" -m "black-forest-labs/flux.2-pro" -o abstract.png
| Model | ID | Capabilities | Best For |
|---|---|---|---|
| Gemini 3 Pro Image | google/gemini-3-pro-image-preview | Generation + Editing | High quality, versatile |
| FLUX Pro | black-forest-labs/flux.2-pro | Generation + Editing | Fast, artistic |
| FLUX Flex | black-forest-labs/flux.2-flex | Generation only | Development/testing |
# Photorealistic
python generate_image.py "Professional headshot of a business executive in modern office setting" -o headshot.png
# Artistic
python generate_image.py "Watercolor painting of a cozy coffee shop on a rainy day" -o coffee_shop.png
# Abstract
python generate_image.py "Abstract visualization of data flowing through neural networks, blue and cyan colors" -o neural_flow.png
# Product
python generate_image.py "Modern minimalist logo for a tech startup called 'Nexus', clean geometric design" -o logo.png
# Change colors
python generate_image.py "Change the car color to red" --input car.jpg -o red_car.png
# Add elements
python generate_image.py "Add a rainbow in the sky" --input landscape.jpg -o rainbow_landscape.png
# Remove elements
python generate_image.py "Remove the person from the background" --input photo.jpg -o clean_photo.png
# Style transfer
python generate_image.py "Make this look like a watercolor painting" --input photo.jpg -o watercolor.png
# PNG (default, best for graphics with transparency)
python generate_image.py "Icon of a rocket ship" -o rocket.png
# Output to specific directory
python generate_image.py "Banner image" -o assets/images/banner.png
export OPENROUTER_API_KEY='your_api_key_here'
Create a .env file in your project:
OPENROUTER_API_KEY=your_api_key_here
from skills.image.scripts.generate_image import generate_image
# Generate new image
result = generate_image(
prompt="A futuristic city at night with neon lights",
output_path="city.png",
model="google/gemini-3-pro-image-preview"
)
# Edit existing image
result = generate_image(
prompt="Add flying cars to the scene",
output_path="city_with_cars.png",
input_image="city.png"
)
# β Too vague
"A dog"
# β
Detailed
"A golden retriever puppy playing in autumn leaves, warm afternoon sunlight, shallow depth of field, professional pet photography"
# β
Specify artistic style
"A mountain landscape in the style of traditional Japanese ink painting, minimalist, black and white with subtle gray tones"
# β
Include framing
"Close-up portrait of an owl, centered composition, soft studio lighting, dark background, sharp focus on the eyes"
# β Vague edit
"Make it better"
# β
Specific edit
"Increase the contrast, make the colors more vibrant, and add a subtle vignette effect"
| Aspect | image Skill | diagram Skill |
|---|---|---|
| Use Case | Photos, art, illustrations | Technical diagrams |
| Quality Review | No | Yes (Gemini 3 Pro) |
| Iteration | Single pass | Smart iteration (1-2 passes) |
| Doc Types | N/A | 13 document types with thresholds |
| Image Editing | Yes | No |
| Best For | Creative visuals | Architecture, flowcharts, ERD |
Rule of thumb: If it's a technical diagram with boxes, arrows, and labels β use diagram. If it's a photo, illustration, or artistic image β use image.
Set the environment variable or create a .env file. See Configuration section.
Make sure the input image path is correct and the file exists.
Large or complex images may take up to 2 minutes. Timeout is set to 120 seconds.
Master authentication and authorization patterns including JWT, OAuth2, session management, and RBAC to build secure, scalable access control systems. Use when implementing auth systems, securing APIs, or debugging security issues.