Skill

Kinetic Video Creator

Generates kinetic typography videos from scripts using AI TTS speech with emotional dynamics, music, word-level timing sync, and Remotion animations. For promos, explainers, social content.

React

ai-ml

npx claudepluginhub aviz85/claude-skills-library

Tool Access

This skill uses the workspace's default tool permissions.

Preview

Create stunning kinetic typography videos with AI-generated speech, music, and dynamic animations.

Supporting Assets

examples/complete-workflow.mdtemplates/music-prompt.mdtemplates/remotion-composition.mdtemplates/single-word-composition.tsxtemplates/speech-template.md

SKILL.md

Similar Skills

Kinetic Video Creator

Generates kinetic typography videos from scripts using AI TTS speech with emotional dynamics, music, word-level timing sync, and Remotion animations. For promos, explainers, social content.

5 files

kinetic-video-creator

remotion-video

221

Creates motion graphics and videos using Remotion (React) with audio sync, web fonts, TailwindCSS, animations, charts, 3D, subtitles, and rendering. For product demos, explainers, data viz.

18 files

armory

video-producer-agent

Orchestrates AI video production workflow: gathers specs interactively, generates scripts/storyboards, Gemini TTS voiceovers, Lyria music, Veo 3.1 clips or image animations, assembles with FFmpeg.

2 files

skills

Stats

Stars27

Forks8

Last CommitJan 22, 2026

Used By2 plugins

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Kinetic Video Creator

Create stunning kinetic typography videos with AI-generated speech, music, and dynamic animations.

Workflow Overview

Script → Craft emotionally compelling speech text
Speech → Use /speech-generator skill for TTS
Transcribe → Use /transcribe skill for word timing
Music → Use /music-generator skill for background
Merge → Combine speech + music
Animate → Create kinetic typography in Remotion
Render → Produce final video
Publish → Use /youtube-uploader skill (optional)

Step 1: Craft the Script

Language Selection

Hebrew (Recommended for aviz's voice):

Use Hebrew emotional directions in brackets
Add natural Hebrew filler words
See "Hebrew Script Guidelines" section below

English:

Use English emotional directions
See "English Script Guidelines" section below

Hebrew Script Guidelines

aviz's cloned voice is optimized for Hebrew. Use these Hebrew directions:

Hebrew Emotional Directions

Direction	Effect
`[נשימה עמוקה]`	Deep breath, pause
`[בהתלהבות]`	Enthusiastic
`[ברצינות]`	Serious tone
`[בעצב]`	Sad, emotional
`[בשקט]`	Quiet, intimate
`[מהר]`	Fast pace
`[לאט ובבירור]`	Slow and clear
`[שאלה]`	Question tone
`[הפתעה]`	Surprise
`[צחוק קל]`	Light laugh
`[בחום]`	Warm tone
`[בכוח]`	Powerful, emphatic

Hebrew Filler Words (for natural flow)

אממ... - hesitation
אהה... - thinking
כאילו... - like
נו... - well
יאללה... - come on
בקיצור... - in short
... - pause

Hebrew Script Example

[נשימה עמוקה] יש רגע...
[לאט ובבירור] רגע שהכל משתנה.

[ברצינות] אבא שלי חלה בפוליו כשהיה תינוק.
כל חייו הוא היה על כיסא גלגלים.

[בהתלהבות] אבל אבא שלי? הוא היה ספורטאי מצטיין!
[בחום] הוא תמיד האמין... שאפשר להגשים כל חלום.

[בעצב] כשהייתי בן חמש עשרה... אבא נפטר.

[בכוח] והכאב הזה? הפך למשימה שלי.
[בחום] לעזור לאנשים אחרים להגשים את החלומות שלהם.

English Script Guidelines

English Emotional Directions

Direction	Effect
`[pause]`	Brief pause
`[long pause]`	Extended pause
`[slowly]`	Slower delivery
`[faster]`	Quickened pace
`[whisper]`	Softer, intimate
`[emphatic]`	Strong emphasis
`[building]`	Increasing intensity
`[warm]`	Friendly tone
`[dramatic]`	Theatrical
`[matter-of-fact]`	Conversational

English Script Template

[HOOK - 5-10 seconds]
[dramatic pause] Opening line that grabs attention.
[slowly, with weight] The provocative statement.

[BUILD - 20-40 seconds]
[building intensity] Establish the context.
[pause for effect] Key insight moment.

[PEAK - 20-30 seconds]
[powerful, emphatic] The main message.
[pause] Let it land.

[RESOLVE - 15-25 seconds]
[warm, inspiring] Paint the vision.
[final beat] Memorable closing.

Step 2: Generate Speech

Use the speech-generator skill:

/speech-generator [path/to/script.txt] -o [path/to/speech.mp3]

Or invoke directly:

cd ~/.claude/skills/speech-generator/scripts
npx ts-node generate_speech.ts -f script.txt -o speech.mp3

Important: The speech-generator uses aviz's cloned voice, which works best with Hebrew text and Hebrew emotional directions.

Step 3: Transcribe for Timing

Use the transcribe skill:

/transcribe [path/to/speech.mp3] --json

Or invoke directly:

cd ~/.claude/skills/transcribe/scripts
npx ts-node transcribe.ts -i speech.mp3 -o transcript.srt --json

Output: transcript_transcript.json with word-level timing data.

Step 4: Generate Background Music

Use the music-generator skill:

/music-generator [composition.json] -o background_music.mp3

Music Composition Template

{
  "duration_ms": 75000,
  "instrumental": true,
  "positive_global_styles": ["cinematic", "inspirational"],
  "negative_global_styles": ["aggressive", "chaotic"],
  "sections": [
    {
      "section_name": "Hook - Mysterious",
      "duration_ms": 12000,
      "positive_local_styles": ["suspenseful", "soft"],
      "negative_local_styles": ["loud"],
      "lines": []
    },
    {
      "section_name": "Build - Rising",
      "duration_ms": 25000,
      "positive_local_styles": ["hopeful", "building"],
      "negative_local_styles": ["slow"],
      "lines": []
    },
    {
      "section_name": "Peak - Triumphant",
      "duration_ms": 20000,
      "positive_local_styles": ["triumphant", "uplifting"],
      "negative_local_styles": ["quiet"],
      "lines": []
    }
  ]
}

Step 5: Merge Audio

ffmpeg -y \
  -i speech.mp3 \
  -i background_music.mp3 \
  -filter_complex "[0:a]volume=1.0[speech];[1:a]volume=0.15[music];[speech][music]amix=inputs=2:duration=first[out]" \
  -map "[out]" -c:a libmp3lame -q:a 2 \
  final_audio.mp3

Step 6: Create Remotion Composition

Project Location

cd /Users/aviz/remotion-assistant

Default Template: SequenceComposition (One Word Per Screen)

Recommended: Use SequenceComposition for maximum impact - displays one word at a time with full-screen typography.

import { SequenceComposition } from '../templates/SequenceComposition';
import transcriptData from '../../projects/[project]/transcript_transcript.json';

const WORD_TIMINGS = transcriptData.words
  .filter((w) => w.word.trim() !== '')
  .map((w) => ({
    word: w.word,
    start: w.start,
    end: w.end,
  }));

export const MyVideo: React.FC = () => {
  return (
    <SequenceComposition
      wordTimings={WORD_TIMINGS}
      audioFile="[project]/final_audio.mp3"
      baseFontSize={200}
      dustEnabled={true}
      lightBeamsEnabled={true}
      centerGlowEnabled={true}
      glowIntensity={1}
      anticipationFrames={5}
      colorSchemeStart={0}
    />
  );
};

Alternative: MultiWordComposition (Word Cloud)

Use for faster-paced content with multiple words on screen:

import { MultiWordComposition } from '../templates/MultiWordComposition';

Hebrew Font Support

For Hebrew text, use Heebo font:

import { loadFont } from '@remotion/google-fonts/Heebo';

const { fontFamily } = loadFont('normal', {
  weights: ['400', '600', '700', '900'],
  subsets: ['hebrew', 'latin'],
});

Add RTL styling:

style={{
  direction: 'rtl',
  fontFamily,
}}

Step 7: Render

cd /Users/aviz/remotion-assistant
npx remotion render CompositionName output.mp4

Step 8: Upload (Optional)

Use the youtube-uploader skill:

/youtube-uploader [video.mp4] --title "Title" --description "Description"

Project Structure

remotion-assistant/
├── public/[project]/
│   └── final_audio.mp3      # Audio for Remotion
├── projects/[project]/
│   ├── speech.txt           # Script
│   ├── speech.mp3           # TTS output
│   ├── transcript_transcript.json  # Word timings
│   ├── music_composition.json
│   ├── background_music.mp3
│   ├── final_audio.mp3      # Merged audio
│   └── output.mp4           # Final video
└── src/compositions/
    └── [ProjectName].tsx    # Composition

Quick Reference

Step	Skill/Command
Speech	`/speech-generator script.txt -o speech.mp3`
Transcribe	`/transcribe speech.mp3 --json`
Music	`/music-generator composition.json`
Merge	`ffmpeg` (see above)
Render	`npx remotion render Name output.mp4`
Upload	`/youtube-uploader output.mp4`