Skill

fal-vision

Analyze images using AI — segment objects, detect objects, extract text (OCR), describe images, ask questions about images. Use when the user requests "Segment image", "Detect objects", "OCR", "Extract text from image", "Describe image", "What's in this image", "Image analysis".

Install

npx claudepluginhub joshuarweaver/cascade-content-creation-misc-1 --plugin fal-ai-community-skills

Tool Access

This skill uses the workspace's default tool permissions.

Preview

Analyze and understand images using fal.ai vision models — segmentation, detection, OCR, captioning, and visual QA.

Supporting Assets

scripts/analyze.sh

SKILL.md

Similar Skills

cache-components

Guides Next.js Cache Components and Partial Prerendering (PPR) with cacheComponents enabled. Implements 'use cache', cacheLife(), cacheTag(), revalidateTag(), static/dynamic optimization, and cache debugging.

cache-components

139.2k

mcp-builder

9 files

Guides building MCP servers enabling LLMs to interact with external services via tools. Covers best practices, TypeScript/Node (MCP SDK), Python (FastMCP).

anthropics-skills-13

124.2k

canvas-design

20 files

Generates original PNG/PDF visual art via design philosophy manifestos for posters, graphics, and static designs on user request.

anthropics-skills-13

124.2k

Stats

Stars81

Forks14

Last CommitMar 6, 2026

Actions

View Source View Plugin View on GitHub View README

fal-vision

Analyze and understand images using fal.ai vision models — segmentation, detection, OCR, captioning, and visual QA.

Scripts

Script	Purpose
`analyze.sh`	Analyze an image (segment, detect, OCR, describe, QA)

Usage

Segment Objects

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation segment --query "the red car"

Detect Objects

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation detect

Extract Text (OCR)

./scripts/analyze.sh --image-url "https://example.com/document.jpg" --operation ocr

Describe Image

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation describe

Visual QA

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation qa --query "How many people are in this image?"

Arguments

Argument	Description	Required
`--image-url`	URL of image to analyze	Yes
`--operation`	segment, detect, ocr, describe, qa	Yes
`--query` / `-q`	Text prompt for segment/qa operations	For segment/qa
`--model` / `-m`	Override model endpoint	No

Finding Models

To discover the best and latest vision/analysis models, use the search API:

# Search for segmentation models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "segmentation"

# Search for object detection models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "object detection"

# Search for OCR models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "ocr"

# Search for image captioning / visual QA models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "caption"
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "visual question"

Or use the search_models MCP tool with keywords like "segmentation", "detection", "ocr", "caption", "vision".