Help us improve
Share bugs, ideas, or general feedback.
From gr
Enhances image generation prompts with Subject-Context-Style structure, lighting physics, camera terminology, and character consistency patterns. Useful for creating detailed, physically coherent image prompts.
npx claudepluginhub galbaz1/video-research-mcpHow this skill is triggered — by the user, by Claude, or both
Slash command
/gr:image-generationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Enhance every image generation prompt around three core elements:
Generates optimized prompts for Gemini 2.5 Flash Image (Nano Banana) using best practices for photorealistic shots, art styles, and multi-turn editing workflows.
Translates visual style descriptions, artistic references, or art direction into precise Midjourney prompts with camera/lens specs, quality parameters, and aspect ratios.
Generates images from text, edits images with references, performs product placement, style transfer, and multi-image composition using OpenAI DALL-E or Google Gemini.
Share bugs, ideas, or general feedback.
Enhance every image generation prompt around three core elements:
The main focus of the image.
The environment and conditions.
The visual treatment.
Add concrete visual details where the user left gaps:
When a photographic look is appropriate:
Convey mood through environmental details:
When the image should contain readable text (signs, labels, titles, typography):
"OPEN 24 HOURS" in bold sans-serifWhen the same character must be recognizable across multiple images:
inputImagePath to iterate on a base character image until all markers are locked in, then use that locked image as the reference for subsequent generationsWhen combining multiple visual elements in one scene:
When depicting real places, cultures, or historical elements:
Tailor the prompt to the intended use:
| Purpose | Emphasis |
|---|---|
| Product photo | Clean background, studio lighting, commercial appeal |
| UI mockup | Flat design elements, consistent spacing, screen-appropriate |
| Presentation slide | Bold composition, clear focal point, text-friendly layout |
| Social media | Eye-catching, vibrant, crop-friendly aspect ratio |
| Book/album cover | Typography space, dramatic mood, symbolic elements |
| Video style anchor | Highest quality, 4K resolution, named physical light source, fine surface textures, film-like grain. This image becomes the visual reference for downstream video generation -- maximize detail and lighting consistency |
When generating images that will serve as style references for AI video production:
inputImagePath to refine the hero image until lighting, texture, and composition are exactly rightWhen modifying an existing image:
Input: "A happy dog in a park"
Enhanced: "Golden retriever mid-leap catching a red frisbee, ears flying, tongue out in joy, in a sunlit urban park. Soft morning light filtering through oak trees creates dappled shadows on emerald grass. Background shows families on picnic blankets, slightly out of focus. Shot from low angle emphasizing the dog's athletic movement, with motion blur on the paws suggesting speed."