Analyze images using AI — segment objects, detect objects, extract text (OCR), describe images, ask questions about images. Use when the user requests "Segment image", "Detect objects", "OCR", "Extract text from image", "Describe image", "What's in this image", "Image analysis".
npx claudepluginhub joshuarweaver/cascade-content-creation-misc-1 --plugin fal-ai-community-skillsThis skill uses the workspace's default tool permissions.
Analyze and understand images using fal.ai vision models — segmentation, detection, OCR, captioning, and visual QA.
Guides Next.js Cache Components and Partial Prerendering (PPR) with cacheComponents enabled. Implements 'use cache', cacheLife(), cacheTag(), revalidateTag(), static/dynamic optimization, and cache debugging.
Guides building MCP servers enabling LLMs to interact with external services via tools. Covers best practices, TypeScript/Node (MCP SDK), Python (FastMCP).
Generates original PNG/PDF visual art via design philosophy manifestos for posters, graphics, and static designs on user request.
Analyze and understand images using fal.ai vision models — segmentation, detection, OCR, captioning, and visual QA.
| Script | Purpose |
|---|---|
analyze.sh | Analyze an image (segment, detect, OCR, describe, QA) |
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation segment --query "the red car"
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation detect
./scripts/analyze.sh --image-url "https://example.com/document.jpg" --operation ocr
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation describe
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation qa --query "How many people are in this image?"
| Argument | Description | Required |
|---|---|---|
--image-url | URL of image to analyze | Yes |
--operation | segment, detect, ocr, describe, qa | Yes |
--query / -q | Text prompt for segment/qa operations | For segment/qa |
--model / -m | Override model endpoint | No |
To discover the best and latest vision/analysis models, use the search API:
# Search for segmentation models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "segmentation"
# Search for object detection models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "object detection"
# Search for OCR models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "ocr"
# Search for image captioning / visual QA models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "caption"
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "visual question"
Or use the search_models MCP tool with keywords like "segmentation", "detection", "ocr", "caption", "vision".