Search everything...

Back to Plugins

Plugin

OCR Skill

Name: OCR Skill
Author: aotenjou

By aotenjou

使用 PaddleOCR 识别图片中的文字内容

Component Overview

/ocr

Commands

Agents

silicon-paddle-ocr

Skills

Hooks

MCP Servers

LSP Servers

Output Styles

Install

npx claudepluginhub aotenjou/silicon-paddleocr

Component Details

Commands (1)

Context

/ocr

使用 PaddleOCR 识别图片中的文字

Skills (1)

silicon-paddle-ocr

/silicon-paddle-ocr

OCR skill using PaddleOCR model via SiliconFlow API. This skill should be used when the user asks to "recognize text from an image", "extract text from a photo", "OCR this image", "read text from screenshot", or mentions "PaddleOCR", "image text recognition", "text extraction from images".

README

silicon-PaddleOCR

A Claude Code plugin that provides OCR (Optical Character Recognition) capabilities using PaddleOCR via the SiliconFlow API.

Features

Extract text from images (JPG, PNG, WebP, BMP, GIF)
Batch processing with glob patterns
JSON output format for programmatic use
Customizable recognition prompts
Support for custom models and parameters

Installation

Clone this repository to your local Claude Code plugins directory:

git clone https://github.com/aotenjou/silicon-PaddleOCR.git ~/.claude/plugins/silicon-PaddleOCR

Set up your SiliconFlow API key as an environment variable:

export SILICONFLOW_API_KEY="your_api_key_here"

Install required Python dependencies:

pip install openai

Usage

Via Claude Code Command

After installing the plugin, use the /ocr command in Claude Code:

/ocr /path/to/image.jpg

Direct Script Usage

You can also run the OCR script directly:

# Single image
python3 skills/ocr/scripts/ocr_skill.py /path/to/image.jpg

# Multiple images with glob pattern
python3 skills/ocr/scripts/ocr_skill.py /path/to/images/*.png

# JSON output
python3 skills/ocr/scripts/ocr_skill.py --json /path/to/image.jpg

# Custom prompt
python3 skills/ocr/scripts/ocr_skill.py -p "Extract as Markdown table" /path/to/table.jpg

# Save results to file
python3 skills/ocr/scripts/ocr_skill.py --json --output results.json /path/to/images/*.jpg

Script Arguments

Argument	Description
`images`	Image file path(s) or glob pattern (required)
`-k, --api-key`	API key (default: SILICONFLOW_API_KEY env)
`-m, --model`	OCR model (default: PaddlePaddle/PaddleOCR-VL-1.5)
`-p, --prompt`	Custom recognition prompt
`-j, --json`	Output in JSON format
`-o, --output`	Save results to file
`--max-tokens`	Max tokens in response (default: 300)

Configuration

Get your API key from SiliconFlow.

Supported Image Formats

JPG/JPEG
PNG
WebP
BMP
GIF

License

MIT License - see LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

Project Structure

silicon-PaddleOCR/
├── .claude-plugin/
│   └── plugin.json              # Plugin manifest
├── commands/
│   └── ocr.md                   # /ocr command definition
└── skills/
    └── ocr/
        ├── SKILL.md             # Skill documentation
        ├── scripts/
        │   └── ocr_skill.py     # Main implementation
        ├── references/
        │   └── api-configuration.md
        └── examples/
            └── sample-usage.sh

Similar Plugins

ui-design

32.9k

193

Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns

Stats

Version1.0.0

Stars1

Installs4

MaintenanceExcellent

AddedFeb 16, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

OCR Skill

By aotenjou

使用 PaddleOCR 识别图片中的文字内容

Component Overview

/ocr

Commands

Agents

silicon-paddle-ocr

Skills

Hooks

MCP Servers

LSP Servers

Output Styles

Install

npx claudepluginhub aotenjou/silicon-paddleocr

Component Details

Commands (1)

Context

/ocr

使用 PaddleOCR 识别图片中的文字

Skills (1)

silicon-paddle-ocr

/silicon-paddle-ocr

README

silicon-PaddleOCR

A Claude Code plugin that provides OCR (Optical Character Recognition) capabilities using PaddleOCR via the SiliconFlow API.

Features

Extract text from images (JPG, PNG, WebP, BMP, GIF)
Batch processing with glob patterns
JSON output format for programmatic use
Customizable recognition prompts
Support for custom models and parameters

Installation

Clone this repository to your local Claude Code plugins directory:

git clone https://github.com/aotenjou/silicon-PaddleOCR.git ~/.claude/plugins/silicon-PaddleOCR

Set up your SiliconFlow API key as an environment variable:

export SILICONFLOW_API_KEY="your_api_key_here"

Install required Python dependencies:

pip install openai

Usage

Via Claude Code Command

After installing the plugin, use the /ocr command in Claude Code:

/ocr /path/to/image.jpg

Direct Script Usage

You can also run the OCR script directly:

# Single image
python3 skills/ocr/scripts/ocr_skill.py /path/to/image.jpg

# Multiple images with glob pattern
python3 skills/ocr/scripts/ocr_skill.py /path/to/images/*.png

# JSON output
python3 skills/ocr/scripts/ocr_skill.py --json /path/to/image.jpg

# Custom prompt
python3 skills/ocr/scripts/ocr_skill.py -p "Extract as Markdown table" /path/to/table.jpg

# Save results to file
python3 skills/ocr/scripts/ocr_skill.py --json --output results.json /path/to/images/*.jpg

Script Arguments

Argument	Description
`images`	Image file path(s) or glob pattern (required)
`-k, --api-key`	API key (default: SILICONFLOW_API_KEY env)
`-m, --model`	OCR model (default: PaddlePaddle/PaddleOCR-VL-1.5)
`-p, --prompt`	Custom recognition prompt
`-j, --json`	Output in JSON format
`-o, --output`	Save results to file
`--max-tokens`	Max tokens in response (default: 300)

Configuration

Get your API key from SiliconFlow.

Supported Image Formats

JPG/JPEG
PNG
WebP
BMP
GIF

License

MIT License - see LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

Project Structure

silicon-PaddleOCR/
├── .claude-plugin/
│   └── plugin.json              # Plugin manifest
├── commands/
│   └── ocr.md                   # /ocr command definition
└── skills/
    └── ocr/
        ├── SKILL.md             # Skill documentation
        ├── scripts/
        │   └── ocr_skill.py     # Main implementation
        ├── references/
        │   └── api-configuration.md
        └── examples/
            └── sample-usage.sh

Similar Plugins

ui-design

32.9k

193

Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns

Stats

Version1.0.0

Stars1

Installs4

MaintenanceExcellent

AddedFeb 16, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

OCR Skill

Component Overview

Install

Component Details

Commands (1)

Skills (1)

README

silicon-PaddleOCR

Features

Installation

Usage

Via Claude Code Command

Direct Script Usage

Script Arguments

Configuration

Supported Image Formats

License

Contributing

Project Structure

Similar Plugins

ui-design

OCR Skill

Component Overview

Install

Component Details

Commands (1)

Skills (1)

README

silicon-PaddleOCR

Features

Installation

Usage

Via Claude Code Command

Direct Script Usage

Script Arguments

Configuration

Supported Image Formats

License

Contributing

Project Structure

Similar Plugins

ui-design

nanobanana