From AI Image Generation
Provides AI image generation prompt engineering for DALL-E 3, Gemini Imagen, Stable Diffusion, and Midjourney. Activated when asked to generate, create, or draw images.
How this skill is triggered — by the user, by Claude, or both
Slash command
/image-gen:image-generationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Expert image prompt engineering for AI image generators (DALL-E 3, Midjourney, Stable Diffusion, Gemini/Nano Banana Pro).
Expert image prompt engineering for AI image generators (DALL-E 3, Midjourney, Stable Diffusion, Gemini/Nano Banana Pro).
See Also - Specialized API Skills:
- gemini-3-pro - Full Gemini suite: Imagen 3, Veo 2 video, TTS, Live API, embeddings
- nano-banana-pro - Photorealistic portrait templates for Gemini
- openai-dalle - Full OpenAI suite: DALL-E 3, Sora 2, Whisper, GPT-4o, TTS
Load reference prompts from: ${WORKSPACE}/.claude/prompts/image_prompts_reference.json
[Main subject] + [Detailed description] + [Pose/Action]
Examples:
[Clothing items] + [Colors] + [Style] + [Accessories]
Examples:
[Location type] + [Details] + [Atmosphere]
Examples:
[Light type] + [Direction] + [Quality] + [Color]
Examples:
[Lens/focal length] + [Angle] + [DOF] + [Film type if applicable]
Examples:
[Resolution] + [Style keywords] + [Mood]
Examples:
[Quality] photo of [subject description], wearing [detailed clothing],
[pose], [expression], [background/environment], [lighting style],
[camera settings], [style keywords]
[Quality] image of [real person description] with [fictional character],
[interaction/pose], [environment], [lighting], [style modifiers],
preserving face exactly as reference
[Product description] on [surface/background], [lighting setup],
[camera angle], [reflections/shadows], [quality keywords]
[Shot type] of [subject] in [dramatic setting], [lighting mood],
[color grading], film still aesthetic, [genre keywords]
--ar [aspect ratio] --v [version] --style raw --q [quality 0.25-2]
--ar 16:9 for widescreen--ar 9:16 for vertical/mobile--ar 1:1 for square--v 6.0 for latest version--style raw for less stylized{
"steps": 30-60,
"cfg_scale": 7-12,
"sampler": "DPM++ 2M Karras",
"width": 1024,
"height": 1024
}
gemini-3.1-flash-image-preview (Nano Banana 2 — fast, cheap, 4K)gemini-3-pro-image-preview (Nano Banana Pro — higher quality, slower)from google import genai + GOOGLE_API_KEYresponse_modalities=['IMAGE', 'TEXT']GEMINI_API_KEY from env if set (SDK conflict)os.environ.pop('GEMINI_API_KEY', None)
client = genai.Client(api_key=os.getenv('GOOGLE_API_KEY'))
response = client.models.generate_content(
model='gemini-3.1-flash-image-preview', # or 'gemini-3-pro-image-preview' for max quality
contents=prompt,
config=types.GenerateContentConfig(response_modalities=['IMAGE', 'TEXT']),
)
blur, low quality, low resolution, grainy, pixelated, jpeg artifacts
extra limbs, deformed hands, extra fingers, distorted face, ugly,
disfigured, bad anatomy, wrong proportions
cartoon, anime, illustration, painting, sketch, 3d render
(when photorealistic is needed)
watermark, text, logo, frame, border, signature, username
Add these for better results:
Hyper-realistic fashion photo of a confident young woman with natural
makeup, wearing an oversized charcoal grey blazer over white t-shirt,
high-waisted black trousers, minimal gold jewelry. Standing with one
hand in pocket, direct eye contact with camera. Seamless white studio
background, soft diffused studio lighting with clean highlights.
Shot on medium format camera, 85mm lens, shallow depth of field.
Professional fashion photography, editorial style, 8K ultra-detailed.
Negative: blur, grainy, extra fingers, deformed, cartoon, watermark
Photorealistic 8K image of a smiling young Asian woman taking a selfie
with Judy Hopps from Zootopia. Both characters standing side by side
in a dark cinema hall, large movie screen visible behind them.
The woman has long black hair, wearing white strapless top with stars.
Judy in her police uniform, smiling. Cinematic lighting, ultra-detailed,
preserve human face exactly as uploaded reference.
Negative: cartoon style on human, deformed face, blurry, low quality
Hyperrealistic 8k photo in bright cozy living room. Subject standing
with humanoid robot partner behind them. Robot: tall athletic build,
silver and gunmetal plates, visible cable muscles, glowing blue eyes,
V-shaped torso. Robot's arm wrapped protectively around subject.
Natural sunlight through white curtains, green houseplants, warm neutral
walls. Realistic skin texture, detailed metal surfaces with micro-scratches,
accurate global illumination. Photoreal, cinematic lighting.
Negative: cartoon, anime, low res, horror, grotesque, human skin on robot
Sleek black smartphone floating at 45-degree angle above polished
dark marble surface. Dramatic side lighting creating elegant shadows
and specular highlights on screen. Subtle reflection on marble.
Pure black background, product photography style, 8K ultra-detailed,
sharp focus, professional commercial quality.
Negative: blur, reflections showing environment, dust, fingerprints, text
When user wants their face in the image:
"Use the exact same face from the uploaded photo without altering
any facial features or identity. Preserve the face, hairstyle,
body type, clothing, and overall style exactly as in the reference."
| Issue | Solution |
|---|---|
| Blurry output | Add "sharp focus", "high resolution" |
| Wrong hands | Add "anatomically correct hands" to prompt, "deformed hands" to negative |
| Cartoon-ish | Add "photorealistic", "photograph", add "cartoon, illustration" to negative |
| Wrong style | Be more specific about style, use negative prompts |
| Text appearing | Add "no text, no watermark" to negative |
npx claudepluginhub jhamidun/claude-code-config-pack --plugin image-genBuilds a throwaway prototype to answer a design question about UI appearance or state/logic behavior. Guides you through two branches: interactive terminal app for logic validation, or multiple UI variations for visual exploration.