gem

Description

Multimodal AI processing using Google Gemini. Use for analyzing PDFs, images, videos, YouTube links, and other large documents. Ideal when you need to extract information from files that require vision or multimodal understanding.

Install

Add the repository(one-time)

/plugin marketplace add hamelsmu/hamel

Install the plugin

/plugin install hamel-tools@hamel

Tool Access

This skill inherits all available tools. When active, it can use any tool Claude has access to.

Skill Content

Gemini Multimodal Tool

Use the ai-gem CLI tool for multimodal AI processing via Google's Gemini API.

Usage

# Text queries
ai-gem "Write a haiku about Python programming"

# Analyze documents
ai-gem "Summarize this document" document.pdf

# Analyze images
ai-gem "What's in this image?" photo.jpg

# Process YouTube videos
ai-gem "Create a 5-point summary" "https://youtu.be/VIDEO_ID"

# Compare multiple files
ai-gem "Compare these files" file1.pdf file2.png

# Web search
ai-gem "Current AI news" --search

Requirements

GEMINI_API_KEY environment variable must be set
The hamel package must be installed: pip install hamel

Supported Input Types

PDFs
Images (PNG, JPEG, GIF, WebP)
Videos (MP4, etc.)
YouTube URLs
Plain text files
Multiple files for comparison

Links

GitHub Stats

3 forks

Updated 5 hours ago

Similar Skills

algorithmic-art

3 files

Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use this when users request creating art using code, generative art, algorithmic art, flow fields, or particle systems. Create original algorithmic art rather than copying existing artists' work to avoid copyright violations.

anthropic-skills

31.2k

brand-guidelines

1 file

Applies Anthropic's official brand colors and typography to any sort of artifact that may benefit from having Anthropic's look-and-feel. Use it when brand colors or style guidelines, visual formatting, or company design standards apply.

anthropic-skills

31.2k

canvas-design

20 files

Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations.

anthropic-skills

31.2k