By flight505
AI-powered image, diagram, and video generation for Claude Code - uses Nano Banana 2 (Gemini 3.1 Flash Image) for fast generation, Nano Banana Pro (Gemini 3 Pro Image) for professional diagrams, and Veo 3.1 for video generation.
npx claudepluginhub flight505/nano-bananaGenerate videos using Veo 3.1 — text-to-video, image-to-video, frame interpolation, and video extension
Generate publication-quality technical diagrams using Nano Banana Pro (gemini-3-pro-image-preview) with AI-powered quality review. Smart iteration only regenerates when quality is below threshold. Supports style presets (technical, visual-abstract, minimal), aspect ratio, and resolution control (512-4K).
Generate and edit images using Nano Banana 2 (gemini-3.1-flash-image-preview, fastest) or Nano Banana Pro. Supports aspect ratio and resolution control via Google GenAI SDK.
Render text-based diagrams (Mermaid, PlantUML, GraphViz, D2, and 23 more) to PNG/SVG via Kroki.io. Use ONLY when the user explicitly asks for text-based diagram rendering or a specific diagram language.
Create Nature-quality visual abstracts — scientific figures using visual metaphors, isometric depth, and physical analogies to convey complex technical systems. Use for README hero images, paper figures, blog graphics, or when the user wants diagrams that go beyond boxes and arrows. Triggers on: 'visual abstract', 'scientific figure', 'Nature-quality', 'publication graphic', 'infographic', 'visual metaphor', or requests for rich/expressive/artistic diagrams.
Use this agent when creating visual narratives, designing infographics, building presentations, or communicating complex ideas through imagery. This agent specializes in transforming data and concepts into compelling visual stories that engage users and stakeholders. Examples:\n\n<example>\nContext: Creating app onboarding illustrations
Executes bash commands
Hook triggers when Bash tool is used
Share bugs, ideas, or general feedback.
Create and edit diagrams with Draw.io through natural language commands. Build flowcharts, wireframes, and architecture diagrams.
Generate publication-quality academic diagrams, statistical plots, and presentation slides using PaperBanana multi-agent framework
Generate images and videos using Google Gemini and Veo models. Provides skills for AI image generation, image editing, text-to-video, and image-to-video workflows.
Semantic search for Claude Code conversations. Remember past discussions, decisions, and patterns.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.