Skill

fal-model-guide

Guides fal.ai model selection for image (FLUX, SDXL), video (Kling, LTX, Sora), audio (Whisper, ElevenLabs) generation with quality-speed-cost comparisons, benchmarks, decision trees, and parameters.

TypeScript

JavaScript

ai-ml

npx claudepluginhub josiahsiegel/claude-plugin-marketplace --plugin fal-ai-master

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/fal-ai-master:fal-model-guide

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

SKILL.md

391 lines · ~2.9k tokens

Similar Skills

fal-text-to-image

Generates images from text prompts using fal.ai FLUX.1/2 Pro/Dev/Schnell and SDXL models. Configures image sizes, guidance scale, inference steps, LoRA, seeds, batching via TypeScript/Python APIs.

fal-ai-master

fal-generate

37.9k

Generates images and videos using fal.ai AI models. Provides guidance and patterns for integrating fal.ai in development workflows requiring AI media generation.

antigravity-awesome-skills

ComfyUI Workflow Generator

Generates ComfyUI workflow JSON files from natural language descriptions for txt2img, img2img, txt2vid, img2vid, upscale, inpaint, audio, and 3D tasks. Outputs valid, importable JSON with model download links and custom node requirements.

20 files

workflow-skill

Stats

LanguageShell

Parent stars37

Parent forks7

MaintenanceGood

Last CommitJan 29, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

Quick Reference

Category	Fast/Cheap	Balanced	Best Quality
Image	FLUX Schnell	FLUX.1 Dev	FLUX.2 Pro
Video	Runway Turbo	LTX-2 Pro	Kling 2.6 Pro
Audio STT	Whisper Turbo	Whisper	Whisper Large v3
Audio TTS	Kokoro	XTTS	ElevenLabs

FLUX Model	Endpoint	Steps	Use Case
FLUX.2 Pro	`fal-ai/flux-2-pro`	28	Production
FLUX.1 Dev	`fal-ai/flux/dev`	28	High quality
FLUX Schnell	`fal-ai/flux/schnell`	4	Fast iteration

Video Model	Duration	Audio	Speed
Kling 2.6 Pro	5-10s	Yes	Slow
LTX-2 Pro	5s	Yes	Medium
Runway Turbo	5-10s	No	Fast

When to Use This Skill

Use for model selection decisions:

Comparing FLUX vs SDXL for image generation
Choosing video models by quality tier
Optimizing costs with appropriate model selection
Understanding model-specific parameters
Building production vs development pipelines

Related skills:

For text-to-image: see fal-text-to-image
For text-to-video: see fal-text-to-video
For audio models: see fal-audio

fal.ai Model Selection Guide

Comprehensive guide to selecting the right fal.ai model for your use case.

Image Generation Models

FLUX Family

Model	Endpoint	Best For	Speed	Quality	Cost
FLUX.2 [pro]	`fal-ai/flux-2-pro`	Production, best quality	Medium	Highest	$$$
FLUX.1 [dev]	`fal-ai/flux/dev`	High quality, open-source	Medium	High	$$
FLUX Schnell	`fal-ai/flux/schnell`	Fast iteration	Fast	Good	$
FLUX LoRA	`fal-ai/flux-lora`	Custom trained styles	Medium	High	$$
FLUX Realism	`fal-ai/flux-realism`	Photorealistic images	Medium	High	$$

FLUX.2 Pro - Latest and best quality

const result = await fal.subscribe("fal-ai/flux-2-pro", {
  input: {
    prompt: "Professional product photo of a watch",
    image_size: "square_hd",
    num_inference_steps: 28,
    guidance_scale: 3.5
  }
});

FLUX.1 Dev - Best open-source option

const result = await fal.subscribe("fal-ai/flux/dev", {
  input: {
    prompt: "A serene mountain landscape",
    image_size: "landscape_16_9",
    num_inference_steps: 28
  }
});

FLUX Schnell - Fast 4-step generation

const result = await fal.subscribe("fal-ai/flux/schnell", {
  input: {
    prompt: "Quick concept sketch",
    num_inference_steps: 4  // Optimized for 4 steps
  }
});

Stable Diffusion Models

Model	Endpoint	Best For
Fast SDXL	`fal-ai/fast-sdxl`	Speed, lower cost
SDXL	`fal-ai/stable-diffusion-xl`	Classic SDXL
SD 1.5	`fal-ai/stable-diffusion-v15`	Legacy compatibility
SDXL Turbo	`fal-ai/sdxl-turbo`	Ultra-fast

// Fast SDXL - Good balance of speed and quality
const result = await fal.subscribe("fal-ai/fast-sdxl", {
  input: {
    prompt: "A colorful abstract painting",
    image_size: "square_hd",
    num_inference_steps: 25
  }
});

Specialized Image Models

Model	Endpoint	Use Case
Recraft V3	`fal-ai/recraft-v3`	Design assets, vectors
Ideogram	`fal-ai/ideogram`	Text in images
Playground v2.5	`fal-ai/playground-v25`	Creative/artistic
Kandinsky 3	`fal-ai/kandinsky-3`	Russian model

Image-to-Image Models

Model	Endpoint	Use Case
FLUX i2i	`fal-ai/flux/dev/image-to-image`	Transform images
FLUX Inpaint	`fal-ai/flux/dev/inpainting`	Edit regions
FLUX ControlNet	`fal-ai/flux/dev/controlnet`	Guided generation
IP-Adapter	`fal-ai/ip-adapter-flux`	Style transfer

// Image-to-Image transformation
const result = await fal.subscribe("fal-ai/flux/dev/image-to-image", {
  input: {
    image_url: "https://example.com/photo.jpg",
    prompt: "Transform into watercolor painting style",
    strength: 0.75  // 0-1, how much to change
  }
});

// Inpainting (edit specific regions)
const result = await fal.subscribe("fal-ai/flux/dev/inpainting", {
  input: {
    image_url: "https://example.com/photo.jpg",
    mask_url: "https://example.com/mask.png",
    prompt: "A golden retriever"
  }
});

// ControlNet (structural guidance)
const result = await fal.subscribe("fal-ai/flux/dev/controlnet", {
  input: {
    prompt: "Modern house design",
    control_image_url: "https://example.com/edges.png",
    controlnet_conditioning_scale: 0.8
  }
});

Video Generation Models

Text-to-Video

Model	Endpoint	Quality	Duration	Audio	Best For
Kling 2.6 Pro	`fal-ai/kling-video/v2.6/pro`	Highest	5-10s	Native	Cinematic
Sora 2	`fal-ai/sora`	Highest	5-20s	Optional	Advanced
LTX-2 Pro	`fal-ai/ltx-video-2-pro`	High	5s	Yes	Fast HQ
Runway Gen-3	`fal-ai/runway/gen3/turbo`	High	5-10s	No	Fast
Luma	`fal-ai/luma-dream-machine`	Good	5s	No	Creative
CogVideoX	`fal-ai/cogvideox`	Good	6s	No	Open source

Kling 2.6 Pro - Best overall quality

const result = await fal.subscribe("fal-ai/kling-video/v2.6/pro", {
  input: {
    prompt: "A majestic eagle soaring over mountains at golden hour",
    duration: 5,
    aspect_ratio: "16:9",
    negative_prompt: "blurry, distorted",
    cfg_scale: 0.5
  }
});

LTX-2 Pro - Fast with audio

const result = await fal.subscribe("fal-ai/ltx-video-2-pro", {
  input: {
    prompt: "Ocean waves crashing on rocks",
    resolution: "720p",
    enable_audio: true
  }
});

Image-to-Video

Model	Endpoint	Best For
MiniMax Hailuo	`fal-ai/minimax/video-01`	Image animation
Kling i2v	`fal-ai/kling-video/v2.6/pro/image-to-video`	HQ animation
Luma i2v	`fal-ai/luma-dream-machine`	Creative
Runway i2v	`fal-ai/runway/gen3/turbo/image-to-video`	Fast

// Animate a still image
const result = await fal.subscribe("fal-ai/minimax/video-01", {
  input: {
    image_url: "https://example.com/portrait.jpg",
    prompt: "Person slowly turns head and smiles",
    prompt_optimizer: true
  }
});

Video-to-Video (Editing)

// Edit/transform existing video
const result = await fal.subscribe("fal-ai/kling-video/o1", {
  input: {
    video_url: "https://example.com/video.mp4",
    prompt: "Change to anime style"
  }
});

Audio Models

Speech-to-Text

Model	Endpoint	Best For
Whisper	`fal-ai/whisper`	Accurate transcription
Whisper Turbo	`fal-ai/whisper-turbo`	Fast transcription

const result = await fal.subscribe("fal-ai/whisper", {
  input: {
    audio_url: "https://example.com/speech.mp3",
    task: "transcribe",  // or "translate"
    language: "en",
    chunk_level: "segment"
  }
});

console.log(result.text);
console.log(result.chunks);  // With timestamps

Text-to-Speech

Model	Endpoint	Best For
Elevenlabs	`fal-ai/elevenlabs`	Premium voices
F5-TTS	`fal-ai/f5-tts`	Voice cloning
Kokoro	`fal-ai/kokoro`	Multi-language

// Text-to-speech with Elevenlabs
const result = await fal.subscribe("fal-ai/elevenlabs", {
  input: {
    text: "Hello, welcome to our service.",
    voice_id: "voice_id_here"
  }
});

3D Generation Models

Model	Endpoint	Use Case
TripoSR	`fal-ai/triposr`	Image to 3D mesh
InstantMesh	`fal-ai/instantmesh`	Fast 3D generation
Stable Zero123	`fal-ai/stable-zero123`	Novel view synthesis

// Generate 3D mesh from image
const result = await fal.subscribe("fal-ai/triposr", {
  input: {
    image_url: "https://example.com/object.jpg"
  }
});

console.log(result.model_mesh.url);  // GLB/OBJ file

Model Selection Decision Tree

What do you want to create?
├── Image
│   ├── From text only?
│   │   ├── Need best quality? → FLUX.2 Pro
│   │   ├── Need fast iteration? → FLUX Schnell
│   │   ├── Need open-source? → FLUX.1 Dev
│   │   └── Budget conscious? → Fast SDXL
│   ├── Transform existing image?
│   │   ├── Style transfer → FLUX i2i
│   │   ├── Edit specific region → FLUX Inpainting
│   │   └── Follow structure → FLUX ControlNet
│   └── Need text in image? → Ideogram
│
├── Video
│   ├── From text only?
│   │   ├── Need best quality? → Kling 2.6 Pro
│   │   ├── Need fast preview? → Runway Gen-3 Turbo
│   │   └── Need audio? → LTX-2 Pro or Kling
│   ├── Animate image? → MiniMax Hailuo
│   └── Edit video? → Kling O1
│
├── Audio
│   ├── Speech to text?
│   │   ├── Accurate → Whisper
│   │   └── Fast → Whisper Turbo
│   └── Text to speech? → Elevenlabs / F5-TTS
│
└── 3D
    └── Image to 3D? → TripoSR

Performance Comparison

Image Generation Speed

Model	~Time (1024x1024)
FLUX Schnell	1-2s
Fast SDXL	2-3s
FLUX.1 Dev	5-8s
FLUX.2 Pro	8-12s

Video Generation Speed

Model	~Time (5s video)
Runway Gen-3 Turbo	30-60s
LTX-2 Pro	60-90s
Kling 2.6 Pro	120-180s

Cost Optimization Strategies

Development Phase
- Use FLUX Schnell for prompt iteration
- Use Fast SDXL for quick tests
- Use Runway Gen-3 Turbo for video previews
Production Phase
- FLUX.2 Pro for final images
- Kling 2.6 Pro for final videos
Batch Processing
- Process during off-peak hours
- Use appropriate image sizes (don't upscale unnecessarily)
- Cache results by seed for reproducibility
Resource Efficiency
- Generate at target resolution (don't generate larger and downscale)
- Use webhooks for high-volume (avoid polling overhead)
- Implement client-side caching

Model-Specific Tips

FLUX Tips

Guidance scale 3-4 works best
28 steps is optimal default
Schnell works best with 4 steps only
Detailed prompts produce better results

Video Tips

Include camera movement in prompts
Describe action/motion explicitly
Use negative prompts to avoid artifacts
Start with shorter duration for testing

Whisper Tips

Provide language hint for better accuracy
Use "translate" task for non-English to English
chunk_level="segment" gives timestamps

Explore More Models

Visit https://fal.ai/models for the complete catalog of 600+ models including:

Face restoration/enhancement
Background removal
Upscaling
Style transfer
OCR
Object detection
And many more specialized models

fal-model-guide

Popularity

Invocation

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

fal-model-guide

Popularity

Invocation

Context Preview

SKILL.md

Quick Reference

When to Use This Skill

fal.ai Model Selection Guide

Image Generation Models

FLUX Family

Stable Diffusion Models

Specialized Image Models

Image-to-Image Models

Video Generation Models

Text-to-Video

Image-to-Video

Video-to-Video (Editing)

Audio Models

Speech-to-Text

Text-to-Speech

3D Generation Models

Model Selection Decision Tree

Performance Comparison

Image Generation Speed

Video Generation Speed

Cost Optimization Strategies

Model-Specific Tips

FLUX Tips

Video Tips

Whisper Tips

Explore More Models

Similar Skills

Help us improve

Quick Reference

When to Use This Skill

fal.ai Model Selection Guide

Image Generation Models

FLUX Family

Stable Diffusion Models

Specialized Image Models

Image-to-Image Models

Video Generation Models

Text-to-Video

Image-to-Video

Video-to-Video (Editing)

Audio Models

Speech-to-Text

Text-to-Speech

3D Generation Models

Model Selection Decision Tree

Performance Comparison

Image Generation Speed

Video Generation Speed

Cost Optimization Strategies

Model-Specific Tips

FLUX Tips

Video Tips

Whisper Tips

Explore More Models