Open source hwp viewer and parser library powered by web technology
-
Updated
Jan 10, 2025 - TypeScript
Open source hwp viewer and parser library powered by web technology
Comprehensive Python engine for extracting data from Hancom Word Processor (HWP) binary documents.
Read, fill, and edit Korean HWP (Hancom Office) documents in Python. Extract text for LLM / RAG pipelines, fill government & university forms programmatically, and rewrite the binary without corrupting it.
📄 Convert Word, Markdown, and HTML files to HWPX format effortlessly with pypandoc-hwpx, streamlining your document processing needs.
Add a description, image, and links to the hwp5 topic page so that developers can more easily learn about it.
To associate your repository with the hwp5 topic, visit your repo's landing page and select "manage topics."