Skill

Make Lip Sync

Lip-sync a face image or video clip to a user-uploaded audio track, producing a 9:16 talking-head video. No TTS or voice cloning.

ai-ml

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/agent-media:make-lip-sync

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

mcp__agent-media__make_lip_sync

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Bring your own audio: lip-sync a face (an R2-hosted image / character sheet, OR an existing clip) to a provided audio track. No text-to-speech or voice cloning — the character speaks your uploaded recording. Output is a 9:16 talking-head video.

SKILL.md

59 lines · ~601 tokens

Stats

Stars49

Forks16

MaintenanceExcellent

Last CommitJun 18, 2026

Actions

View Source View Plugin View on GitHub View README

Make Lip Sync

When to use this

Call this skill when the user asks for the outcome described above. It runs on the agent-media vNext primitive runtime via the mcp__agent-media__make_lip_sync MCP tool. Authentication is the user's existing agent-media Bearer token (issued by agent-media login).

How to call it

Preferred path: MCP tool mcp__agent-media__make_lip_sync. Schema is auto-published via tools/list against the same MCP server, so don't restate the schema here — trust the server's response.

Fallback path: REST.

POST https://api.agent-media.ai/v1/skills/make_lip_sync/run
Authorization: Bearer $AGENT_MEDIA_API_KEY
Content-Type: application/json
Idempotency-Key: <any unique string per intent>

{
  "image_url": "https://pub-...r2.dev/vnext/primitive-runs/<id>/character-sheet.png",
  "audio_url": "https://pub-...r2.dev/vnext/<your-uploaded-audio>.mp3",
  "duration": 10,
  "aspect_ratio": "9:16"
}

What it costs and how long it takes

Credits: 140/280/420 (5s/10s/15s)
Wall time (typical): 420–480s
Deducted at submit; refunded on terminal failure.

Polling the result

GET https://api.agent-media.ai/v1/primitives/runs/<run_id>
Authorization: Bearer $AGENT_MEDIA_API_KEY

House rules baked into this skill

See reference/realism-rubric.md for the realism doctrine baked into every prompt.
See reference/auth.md for first-time install and agent-media login.

Source of truth

This file is auto-generated by scripts/generate-public-skill.ts from the registry at services/api-v2/src/skills/registry.ts. Do not hand-edit; CI rejects drift.

Make Lip Sync

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

Make Lip Sync

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

Make Lip Sync

When to use this

How to call it

What it costs and how long it takes

Polling the result

House rules baked into this skill

Source of truth

Similar Skills

Make Lip Sync

When to use this

How to call it

What it costs and how long it takes

Polling the result

House rules baked into this skill

Source of truth

Similar Skills