From skillkit-creative
Transforms creative ideas into production-ready screenplays with scene breakdowns, visual descriptions, proper formatting, and XML-tagged markdown for AI video pipelines like imagine and arch-v.
npx claudepluginhub rfxlamia/skillkit --plugin skillkit-creativeThis skill uses the workspace's default tool permissions.
This skill transforms creative concepts into professional screenplay documents optimized for AI-powered video production pipelines. It bridges the gap between raw story ideas and production-ready scripts by generating structured, visual-rich narratives in industry-standard screenplay format.
Conducts multi-round deep research on GitHub repos via API and web searches, generating markdown reports with executive summaries, timelines, metrics, and Mermaid diagrams.
Dynamically discovers and combines enabled skills into cohesive, unexpected delightful experiences like interactive HTML or themed artifacts. Activates on 'surprise me', inspiration, or boredom cues.
Generates images from structured JSON prompts via Python script execution. Supports reference images and aspect ratios for characters, scenes, products, visuals.
This skill transforms creative concepts into professional screenplay documents optimized for AI-powered video production pipelines. It bridges the gap between raw story ideas and production-ready scripts by generating structured, visual-rich narratives in industry-standard screenplay format.
Pipeline Position: diverse-content-gen → screenwriter → imagine → arch-v
Key Capabilities:
Slugline Format:
INT/EXT. LOCATION - TIME
Components:
Examples:
EXT. WASTELAND - DAWN
INT. ABANDONED SUBWAY STATION - NIGHT
EXT. ROOFTOP GARDEN - GOLDEN HOUR
Guidelines:
Purpose: Describe what the audience sees on screen. This is CRITICAL for image generation.
Visual-Rich Writing Principles:
Example - Weak:
A robot walks through the city. It's sad.
Example - Strong:
A BOXY ROBOT (Unit-7, weathered chrome with a single blue optical sensor) rolls through fog-shrouded streets. Neon signs flicker overhead, casting pink and cyan reflections on wet pavement. The robot's movements are slow, deliberate—almost hesitant.
Visual Enhancement Checklist:
First Appearance - Detailed:
SARAH (28, sharp eyes, wearing a weathered leather jacket over faded jeans) enters the frame. Her dark hair is pulled back, revealing a small scar above her left eyebrow.
Subsequent Appearances - Brief:
Sarah checks her watch.
Guidelines:
Format:
CHARACTER NAME
(parenthetical - optional)
Dialogue goes here.
Guidelines for Short Films:
Example:
UNIT-7 (robotic voice, soft)
Organic life form detected.
Probability of survival: low.
Common Transitions:
FADE IN: - Opening of screenplay onlyCUT TO: - Scene change (usually implied, use for emphasis)SMASH CUT TO: - Abrupt, jarring transitionDISSOLVE TO: - Passage of timeFADE OUT. - End of screenplayModern Best Practice: Most transitions are IMPLIED. Use sparingly, only for specific narrative effect.
Each scene wrapped in XML with metadata for pipeline processing:
<scene number="1" duration="30-45s">
<slugline>EXT. WASTELAND - DAWN</slugline>
<location>Wasteland</location>
<time>Dawn</time>
<characters>Unit-7</characters>
<mood>desolate, lonely</mood>
<key_visuals>
<visual>post-apocalyptic wasteland with ruined skyscrapers</visual>
<visual>boxy robot with single blue optical sensor</visual>
<visual>dust and smog atmosphere, weak pale sun</visual>
</key_visuals>
<action>
Gray dust covers everything. Skeletal remains of skyscrapers pierce the horizon. The sun, pale and weak, struggles through thick smog.
A ROBOT (Unit-7, boxy frame with single blue optical sensor) rolls across cracked asphalt. Its treads leave marks in the dust—the only sign of life.
The robot stops at a pile of rubble, extending a mechanical arm to sort through debris. Methodical. Purposeful. Lonely.
</action>
</scene>
number: Scene sequence number (1, 2, 3...)duration: Estimated screen time (for 5-10 min total)slugline: Master scene headinglocation: Extracted location nametime: Time of daycharacters: Comma-separated character listmood: Emotional tone/atmospherekey_visuals: Array of specific visual elements for image generationaction: The full action/description textdialogue (optional): Character dialogue if presentAverage: ~30-60 seconds per scene
Act 1 - Setup (20-25%): 2-3 scenes
Act 2 - Confrontation (50-60%): 4-8 scenes
Act 3 - Resolution (20-25%): 2-3 scenes
For detailed guidance on metadata standards, visual optimization, and integration with imagine/arch-v:
For sophisticated screenwriting techniques, camera movement hints, and pacing optimization: