Skill

htr-transcription

Transcribes handwritten historical documents using HTRflow MCP tools. Provides interactive viewer artifact, per-line JSON transcriptions, and archival exports.

Python

ai-ml

npx claudepluginhub ai-riksarkivet/ra-mcp --plugin ra-mcp-tools

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/ra-mcp-tools:htr-transcription

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Transcribe handwritten historical documents using the HTRflow MCP server.

SKILL.md

139 lines · ~1.1k tokens

Similar Skills

visa-doc-translate

213.7k

Translate visa application documents (images) to English and create a bilingual PDF with original and translation

1 file

ecc

visa-doc-translate

Translates visa application document images to English via OCR (macOS Vision, EasyOCR, Tesseract), auto-rotates via EXIF, and generates bilingual A4 PDFs with original image and formatted translation. For bank/employment certificates.

1 file

everything-claude-code

paddleocr-doc-parsing

313

Parses complex documents with PaddleOCR to extract text, tables, formulas, charts, and layout structure. Use for invoices, academic papers, multi-column layouts, or any document needing structured understanding.

10 files

business-intelligence-skills

Stats

LanguagePython

Parent stars8

Parent forks2

MaintenanceExcellent

Last CommitApr 1, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

HTR Transcription

Transcribe handwritten historical documents using the HTRflow MCP server. Returns an interactive viewer, per-line transcription JSON, and archival exports.

Tools

htr_transcribe — Transcribe images and return result URLs

Workflow

1. Determine image source

http/https URLs (IIIF links, public image URLs): Use directly — skip to step 2.
Local files or attachments: Must be uploaded first. Use the /upload-files skill, then continue to step 2.

2. Transcribe

Call htr_transcribe once with ALL image URLs in a single call.

Batching rule: Never call htr_transcribe multiple times for separate images. Each call runs an expensive GPU pipeline — batch everything.

3. Present results

After transcription, present results as an inline artifact for the viewer and downloadable links for data exports.

4a. Inline viewer artifact

Download the viewer HTML, then inline all external dependencies (OpenSeadragon JS and images) so the artifact is fully self-contained (the artifact sandbox blocks external requests).

curl -sL "{viewer_url}" -o /home/claude/viewer.html

Then run this Python script to embed dependencies:

import re, base64, urllib.request

with open("/home/claude/viewer.html", "r") as f:
    html = f.read()

# Inline OpenSeadragon JS (CDN script -> inline script)
osd_match = re.search(r'<script src="(https://cdn[^"]+openseadragon[^"]+)">\s*</script>', html)
if osd_match:
    with urllib.request.urlopen(osd_match.group(1)) as resp:
        osd_js = resp.read().decode()
    html = html.replace(osd_match.group(0), f"<script>{osd_js}</script>")

# Embed all Gradio image URLs as base64 data URIs
for url in set(re.findall(
    r'https://riksarkivet-htr-demo\.hf\.space/gradio_api/file=[^\s"]+\.(?:jpg|png)', html
)):
    with urllib.request.urlopen(url) as resp:
        img_data = resp.read()
    ext = "jpeg" if url.endswith(".jpg") else "png"
    data_uri = f"data:image/{ext};base64,{base64.b64encode(img_data).decode()}"
    html = html.replace(url, data_uri)

with open("/mnt/user-data/outputs/viewer.html", "w") as f:
    f.write(html)

Then call present_files with /mnt/user-data/outputs/viewer.html to render the interactive viewer as an inline artifact.

4b. Export links

Provide the remaining URLs as clickable download links:

Transcription data: [pages_url] (per-line JSON)

Export: [export_url] (archival export)

Do NOT reproduce document text as plain text in your response — present the artifact and links instead.

Options

Language

Value	Use when
`swedish`	Swedish handwriting (default)
`norwegian`	Norwegian handwriting
`english`	English handwriting
`medieval`	Medieval scripts

Layout

Value	Use when
`single_page`	Single pages, snippets, cropped regions (default)
`spread`	Two-page book openings (Swedish only)

Export format

Value	Description
`alto_xml`	ALTO XML — standard archival (default)
`page_xml`	PAGE XML — alternative archival format
`json`	JSON — structured data format

Custom pipeline

custom_yaml accepts a raw HTRflow YAML config string. Overrides language and layout. Use only when user explicitly provides one.

Example — English modern handwriting with a custom TrOCR model:

steps:
- step: Segmentation
  settings:
    model: yolo
    model_settings:
      model: Riksarkivet/yolov9-lines-within-regions-1
- step: TextRecognition
  settings:
    model: TrOCR
    model_settings:
      model: microsoft/trocr-base-handwritten
    generation_settings:
       batch_size: 16
- step: OrderLines

htr-transcription

Popularity

Invocation

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

htr-transcription

Popularity

Invocation

Context Preview

SKILL.md

HTR Transcription

Tools

Workflow

1. Determine image source

2. Transcribe

3. Present results

4a. Inline viewer artifact

4b. Export links

Options

Language

Layout

Export format

Custom pipeline

Similar Skills

Help us improve

HTR Transcription

Tools

Workflow

1. Determine image source

2. Transcribe

3. Present results

4a. Inline viewer artifact

4b. Export links

Options

Language

Layout

Export format

Custom pipeline