pdf-conversion
PDF Conversion Skill
Overview
Converts PDF files to clean, readable Markdown format with proper formatting, image extraction, and table preservation.
When to Use
- "Convert this PDF to markdown"
- "Extract text from PDF"
- "Transform PDF to MD format"
- "Process PDFs in folder X"
Related Resources
- SOP:
Operator Team OS/1. SOPs/convert_pdfs_to_markdown.md
Scripts
| Script | Purpose |
|---|---|
pdf_to_md.py |
Main PDF to Markdown converter |
Usage
python3 scripts/pdf_to_md.py /path/to/input.pdf /path/to/output.md
Features
- Text extraction with OCR fallback
- Image extraction and embedding
- Table detection and formatting
- Heading structure preservation
- Automatic cleanup of formatting artifacts
Edge Cases
- Scanned PDFs: Falls back to OCR
- Complex tables: Best-effort conversion
- Encrypted PDFs: Will fail (requires password)
More from rangerrick337/operator-os
xlsx
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas
5docx
Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction. When Claude needs to work with professional documents (.docx files) for: (1) Creating new documents, (2) Modifying or editing content, (3) Working with tracked changes, (4) Adding comments, or any other document tasks
4skill-creator
Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.
4pptx
Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks
3memory-manager
Manages the 3-tier memory system. Use when: (1) the user says 'remember this' or 'commit to memory', (2) a significant decision or fact is established, (3) the user runs /memory-save or /memory-review, (4) at session start to load memory context, or (5) at session end to flush important context to logs.
2