pymupdf-pdf
SKILL.md
PyMuPDF PDF
Overview
Parse PDFs locally using PyMuPDF for fast, lightweight extraction into Markdown by default, with optional JSON and image/table outputs in a per-document directory.
Prereqs / when to read references
If you hit import errors (PyMuPDF not installed) or Nix libstdc++ issues, read:
references/pymupdf-notes.md
Quick start (single PDF)
# Run from the skill directory
./scripts/pymupdf_parse.py /path/to/file.pdf \
--format md \
--outroot ./pymupdf-output
Options
--format md|json|both(default:md)