Use when the user asks to "generate an image", "create an image", "make a picture of", "draw", "design", "illustrate", "edit this image", "modify this photo", "change the background", or needs any kind of image generation or editing. Also triggers for "create a logo", "make a thumbnail", "generate artwork", "mockup", or visual asset creation. Uses Google Gemini's native image generation (text-to-image and image editing).
npx claudepluginhub oliverames/ames-claude --plugin ames-standalone-skillsThis skill is limited to using the following tools:
Create and edit images using Google's Gemini image generation API. Supports text-to-image
Provides Ktor server patterns for routing DSL, plugins (auth, CORS, serialization), Koin DI, WebSockets, services, and testApplication testing.
Conducts multi-source web research with firecrawl and exa MCPs: searches, scrapes pages, synthesizes cited reports. For deep dives, competitive analysis, tech evaluations, or due diligence.
Provides demand forecasting, safety stock optimization, replenishment planning, and promotional lift estimation for multi-location retailers managing 300-800 SKUs.
Create and edit images using Google's Gemini image generation API. Supports text-to-image generation, image editing with reference images, and resolution control up to 4K.
# Text to image
uv run ~/.claude/skills/generate-image/generate_image.py -p "description" -f output.png
# Higher resolution
uv run ~/.claude/skills/generate-image/generate_image.py -p "description" -f output.png -r 2K
# Edit an existing image
uv run ~/.claude/skills/generate-image/generate_image.py -p "make the sky purple" -f output.png -i source.png
# Multiple reference images
uv run ~/.claude/skills/generate-image/generate_image.py -p "combine these styles" -f output.png -i ref1.png -i ref2.png
-i flag)Generate an image purely from a text description. Best for:
-i flag)Modify existing images using text instructions. Pass one or more reference images:
-i photo.png — edit a single image-i ref1.png -i ref2.png — combine or reference multiple imagesBest for:
When using reference images, resolution auto-detects from the first image (overridable with -r).
| Flag | Size | Best for |
|---|---|---|
-r 1K | ~1024px (default) | Quick drafts, social media, thumbnails |
-r 2K | ~2048px | Blog posts, presentations, detailed work |
-r 4K | ~4096px | Print, high-detail artwork, large displays |
Higher resolution takes longer and uses more API quota. Start with 1K for iteration, then regenerate at higher res when you're happy with the result.
The prompt is everything. A vague prompt gets a generic image; a specific prompt gets what you actually want.
"A [subject] in [style], [specific details], [mood/lighting/color]"
Weak: "a cat" Better: "a tabby cat sitting on a windowsill, watercolor style, warm afternoon light, soft pastel colors"
| Use case | Prompt pattern |
|---|---|
| Logo | "Minimal logo for [brand], [style], on white background, clean lines" |
| Thumbnail | "YouTube thumbnail showing [scene], bold and eye-catching, vibrant colors" |
| Social media | "[Subject], Instagram-style, bright and saturated, square composition" |
| Hero image | "Wide banner showing [scene], professional, [mood], 16:9 aspect ratio" |
| Icon | "App icon of [concept], flat design, rounded corners, single color palette" |
| Concept art | "[Scene description], concept art style, atmospheric, detailed environment" |
open output.png or let the user see it via MEDIA: outputThe script saves a PNG file to the specified path and prints:
MEDIA: /path/to/output.png — for automatic display in supported environmentsTo show the user the result:
open output.png # macOS: opens in Preview
GEMINI_API_KEY environment variable (Google Gemini API key)google-genai, pillow) are auto-installed by uv rungemini-3-pro-image-preview| Issue | Solution |
|---|---|
| "No API key provided" | Set GEMINI_API_KEY env var or pass --api-key |
| Empty response / safety filter | Rephrase the prompt — some content triggers safety filters |
| "No image was generated" | The model returned text only — try a more visual/descriptive prompt |
| Low quality output | Be more specific in the prompt; try higher resolution |
| Wrong style | Explicitly name the style you want ("photorealistic", "vector", etc.) |