npx claudepluginhub aotenjou/silicon-paddleocrA Claude Code plugin that provides OCR (Optical Character Recognition) capabilities using PaddleOCR via the SiliconFlow API.
git clone https://github.com/aotenjou/silicon-PaddleOCR.git ~/.claude/plugins/silicon-PaddleOCR
export SILICONFLOW_API_KEY="your_api_key_here"
pip install openai
After installing the plugin, use the /ocr command in Claude Code:
/ocr /path/to/image.jpg
You can also run the OCR script directly:
# Single image
python3 skills/ocr/scripts/ocr_skill.py /path/to/image.jpg
# Multiple images with glob pattern
python3 skills/ocr/scripts/ocr_skill.py /path/to/images/*.png
# JSON output
python3 skills/ocr/scripts/ocr_skill.py --json /path/to/image.jpg
# Custom prompt
python3 skills/ocr/scripts/ocr_skill.py -p "Extract as Markdown table" /path/to/table.jpg
# Save results to file
python3 skills/ocr/scripts/ocr_skill.py --json --output results.json /path/to/images/*.jpg
| Argument | Description |
|---|---|
images | Image file path(s) or glob pattern (required) |
-k, --api-key | API key (default: SILICONFLOW_API_KEY env) |
-m, --model | OCR model (default: PaddlePaddle/PaddleOCR-VL-1.5) |
-p, --prompt | Custom recognition prompt |
-j, --json | Output in JSON format |
-o, --output | Save results to file |
--max-tokens | Max tokens in response (default: 300) |
Get your API key from SiliconFlow.
MIT License - see LICENSE file for details.
Contributions are welcome! Please feel free to submit issues or pull requests.
silicon-PaddleOCR/
├── .claude-plugin/
│ └── plugin.json # Plugin manifest
├── commands/
│ └── ocr.md # /ocr command definition
└── skills/
└── ocr/
├── SKILL.md # Skill documentation
├── scripts/
│ └── ocr_skill.py # Main implementation
├── references/
│ └── api-configuration.md
└── examples/
└── sample-usage.sh
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.