From oma
Converts HWP/HWPX/HWPML files to Markdown using kordoc, extracting text, headings, tables, lists, images, footnotes, hyperlinks. For Korean Hangul documents, government files, AI/RAG prep.
npx claudepluginhub first-fluke/oh-my-agent --plugin omaThis skill uses the workspace's default tool permissions.
Convert Korean HWP-family documents into readable Markdown or structured JSON while preserving document structure for LLM context, RAG, government-document review, or enterprise document processing.
Parses HWP 5.x, HWPX, and PDF Korean documents to Markdown and structured IRBlock data using kordoc TypeScript library and CLI. Supports table extraction, form fields, diffing, OCR, and MCP server.
Converts PDF, DOCX, PPTX, XLSX, HTML, images to structured Markdown or JSON using Docling. Preserves tables, layout, hierarchy for RAG pipelines and knowledge bases.
Converts PDF, Word, PPTX, PPT, and TXT documents to Markdown, preserving titles, lists, tables, structure, and PPT slide sections. Adds frontmatter, annotates images, suggests output path for archiving or import.
Share bugs, ideas, or general feedback.
Convert Korean HWP-family documents into readable Markdown or structured JSON while preserving document structure for LLM context, RAG, government-document review, or enterprise document processing.
.hwp, .hwpx, or .hwpml..hwp, .hwpx, .hwpml) to Markdownoma-pdf (OCR + Tagged PDF specialization)oma-docs)input_path: .hwp, .hwpx, or .hwpml file pathoutput_path or output_dir: optional explicit output targetformat: optional output format, default markdownpage_range: optional page or section rangekordoc_version: optional pinned kordoc versionbun and bunxbunx kordoc@latest or configured pinned kordoc versionresources/flatten-tables.ts for Markdown cleanupbunx and bun run.hwp, .hwpx, or .hwpml.bun is available.kordoc with explicit output target and requested options..pdf, stop and route to oma-pdf..xlsx or .docx, explain that this skill does not advertise those formats.bun is unavailable, stop and ask the user to install Bun.resources/flatten-tables.ts unless the caller explicitly needs HTML tables or PUA glyphs preserved.resources/troubleshooting.md.| Failure | Recovery |
|---|---|
bun or bunx unavailable | Ask user to install Bun |
| Unsupported or mismatched format | Check extension and magic bytes, then route or stop |
| Encrypted or DRM-locked document | Report limitation and request an accessible copy when needed |
| Empty Markdown output | Treat as possible scanned-image content and recommend OCR outside this skill |
| Complex merged tables | Accept flattened Markdown or HTML fallback as best effort |
| Stale kordoc cache | Use bunx kordoc@latest or configured pinned version |
| Action | SSL primitive | Evidence |
|---|---|---|
| Validate file path and extension | VALIDATE | Input preflight in execution protocol |
| Check runtime availability | VALIDATE | bun --version |
| Select output target and format | SELECT | Output behavior and config |
| Run converter | CALL_TOOL | bunx kordoc@latest |
| Write output artifact | WRITE | Markdown or JSON output |
| Flatten tables and strip PUA glyphs | CALL_TOOL | resources/flatten-tables.ts |
| Inspect extraction quality | VALIDATE | Verification step |
| Report result | NOTIFY | Final user-facing summary |
kordoc: primary HWP-family conversion CLIflatten-tables.ts: post-processing for GFM tables and Hancom PUA cleanupbun / bunx: runtime and CLI executorbunx kordoc@latest "{input_path}" -o "{output_path}"
bun run ".agents/skills/oma-hwp/resources/flatten-tables.ts" "{output_path}"
For batch conversion, use an explicit output directory:
bunx kordoc@latest "{input_pattern}" -d "{output_dir}"
| Scope | Resource target |
|---|---|
LOCAL_FS | Input HWP-family files and generated outputs |
PROCESS | bunx kordoc and bun run subprocesses |
MEMORY | Format decisions, validation notes, and final report |
bun is installed.kordoc can parse the document or fail with a reportable error.@latest or an explicit pinned version to avoid stale bunx cache.resources/execution-protocol.md and troubleshooting branches in resources/troubleshooting.md.| Format | Extension | Notes |
|---|---|---|
| HWP 5.x binary | .hwp | Full support (incl. DRM-locked via kordoc's rhwp-algorithm port) |
| HWPX | .hwpx | Full support incl. nested tables, merged cells |
| HWPML | .hwp (XML variant) | Auto-detected by signature |
kordoc also parses PDF / XLSX / DOCX. Those are intentionally outside this skill's scope — see "When NOT to use".
resources/execution-protocol.mdresources/troubleshooting.mdconfig/hwp-config.yaml../oma-pdf/SKILL.md (use for .pdf inputs)