documents
Customization
Before executing, check for user customizations at:
~/.claude/skills/CORE/USER/SKILLCUSTOMIZATIONS/Documents/
If this directory exists, load and apply any PREFERENCES.md, configurations, or resources found there. These override default behavior. If the directory does not exist, proceed with skill defaults.
🚨 MANDATORY: Voice Notification (REQUIRED BEFORE ANY ACTION)
You MUST send this notification BEFORE doing anything else when this skill is invoked.
-
Send voice notification:
curl -s -X POST http://localhost:8888/notify \ -H "Content-Type: application/json" \ -d '{"message": "Running the WORKFLOWNAME workflow in the Documents skill to ACTION"}' \ > /dev/null 2>&1 & -
Output text notification:
Running the **WorkflowName** workflow in the **Documents** skill to ACTION...
This is not optional. Execute this curl command immediately upon skill invocation.
Documents Skill
🎯 Load Full CORE Context
Before starting any task with this skill, load complete CORE context:
read ~/.claude/skills/CORE/SKILL.md
When to Activate This Skill
Word Documents (DOCX)
- User wants to create, edit, or analyze Word documents
- User mentions "tracked changes", "redlining", "document review"
- User needs to convert documents to other formats
- User wants to work with document structure, comments, or formatting
PDF Files
- User wants to create, merge, split, or manipulate PDFs
- User mentions "extract text from PDF", "PDF tables", "fill PDF form"
- User needs to convert PDFs to/from other formats
- User wants to add watermarks, passwords, or extract images
PowerPoint Presentations (PPTX)
- User wants to create or edit presentations
- User mentions "slides", "presentation template", "speaker notes"
- User needs to convert presentations to other formats
- User wants to work with slide layouts or design elements
Excel Spreadsheets (XLSX)
- User wants to create or edit spreadsheets
- User mentions "formulas", "financial model", "data analysis"
- User needs to work with Excel tables, charts, or pivot tables
- User wants to convert spreadsheets to/from other formats
🔀 Document Type Routing
This skill organizes document processing across 4 document types:
Word Documents (DOCX)
Reference Documentation:
docx/SKILL.md- Complete DOCX processing guidedocx/docx-js.md- Creating new documents with JavaScriptdocx/ooxml.md- Editing existing documents with OOXML
Routing Logic:
- "Create Word document", "new docx" → Create workflow (docx-js)
- "Edit Word document", "tracked changes", "redlining" → Edit workflow (OOXML)
- "Read Word document", "extract text from docx" → Read workflow (pandoc)
- "Document review", "track changes" → Redlining workflow
Supporting Resources:
- Scripts:
~/.claude/skills/documents/docx/Scripts/ - OOXML tools:
~/.claude/skills/documents/docx/ooxml/ - License:
~/.claude/skills/documents/docx/LICENSE.txt
Key Capabilities:
- Create professional documents with docx-js
- Edit with tracked changes (redlining workflow)
- Extract text/comments with pandoc
- Convert to images for visual inspection
- Work with raw OOXML for advanced features
PDF Processing
Reference Documentation:
pdf/SKILL.md- Complete PDF processing guidepdf/forms.md- Filling PDF formspdf/reference.md- Advanced features and troubleshooting
Routing Logic:
- "Create PDF" → Creation workflow (reportlab)
- "Merge PDFs", "split PDF" → Manipulation workflow (pypdf)
- "Extract text from PDF" → Extraction workflow (pdfplumber)
- "Fill PDF form" → Forms workflow (pdf-lib or pypdf)
- "Extract tables from PDF" → Table extraction (pdfplumber + pandas)
Supporting Resources:
- Scripts:
~/.claude/skills/documents/pdf/Scripts/ - License:
~/.claude/skills/documents/pdf/LICENSE.txt
Key Capabilities:
- Create PDFs with reportlab
- Extract text/tables with pdfplumber
- Merge/split with pypdf or qpdf
- Fill forms programmatically
- Add watermarks and password protection
- Extract images from PDFs
PowerPoint Presentations (PPTX)
Reference Documentation:
pptx/SKILL.md- Complete PPTX processing guidepptx/html2pptx.md- Creating presentations from HTMLpptx/ooxml.md- Editing existing presentations
Routing Logic:
- "Create presentation", "new slides" → Creation workflow (html2pptx)
- "Edit presentation", "modify slides" → Edit workflow (OOXML)
- "Use presentation template" → Template workflow
- "Extract slide text" → Read workflow (markitdown)
- "Create thumbnail grid" → Thumbnail workflow
Supporting Resources:
- Scripts:
~/.claude/skills/documents/pptx/Scripts/ - OOXML tools:
~/.claude/skills/documents/pptx/ooxml/ - License:
~/.claude/skills/documents/pptx/LICENSE.txt
Key Capabilities:
- Create presentations with html2pptx (HTML → PPTX)
- Professional design with color palettes and layouts
- Edit with OOXML for advanced features
- Work with templates (rearrange, inventory, replace)
- Generate thumbnail grids for visual analysis
- Convert to images for inspection
Excel Spreadsheets (XLSX)
Reference Documentation:
xlsx/SKILL.md- Complete XLSX processing guidexlsx/recalc.py- Formula recalculation script
Routing Logic:
- "Create spreadsheet", "new Excel file" → Creation workflow (openpyxl)
- "Edit spreadsheet", "modify Excel" → Edit workflow (openpyxl)
- "Analyze data", "read Excel" → Analysis workflow (pandas)
- "Financial model", "formulas" → Financial modeling workflow
- "Recalculate formulas" → Recalculation workflow (recalc.py)
Supporting Resources:
- Recalc script:
~/.claude/skills/documents/xlsx/recalc.py - License:
~/.claude/skills/documents/xlsx/LICENSE.txt
Key Capabilities:
- Create spreadsheets with formulas (openpyxl)
- Data analysis with pandas
- Financial modeling with color coding standards
- Formula recalculation with LibreOffice
- Error detection and validation
- Preserve formatting and formulas when editing
📋 Document Processing Principles
DOCX Best Practices
- Tracked Changes - Use redlining workflow for professional document review
- Minimal Edits - Only mark text that actually changes, preserve original RSIDs
- Batch Changes - Group related edits (3-10 changes) for efficient processing
- Verification - Always convert to markdown to verify changes applied correctly
PDF Best Practices
- Library Selection - pypdf for basic ops, pdfplumber for text/tables, reportlab for creation
- OCR for Scanned - Use pytesseract + pdf2image for scanned documents
- Form Filling - Follow forms.md for programmatic form completion
- Command Line - Use qpdf/pdftotext for simple operations
PPTX Best Practices
- Design First - Analyze content and choose appropriate colors/layouts before coding
- Web-Safe Fonts - Only use web-safe fonts (Arial, Helvetica, Times, etc.)
- Visual Verification - Always generate thumbnails to inspect layout issues
- Template Analysis - Create inventory before using templates to understand structure
XLSX Best Practices
- Use Formulas - ALWAYS use Excel formulas, NEVER hardcode calculated values
- Zero Errors - Deliver with zero formula errors (#REF!, #DIV/0!, etc.)
- Recalculate - Run recalc.py after creating/editing to update formula values
- Financial Standards - Follow color coding (blue inputs, black formulas, green links)
Examples
Example 1: Create proposal with tracked changes
User: "Create a consulting proposal doc with redlining"
→ Routes to DOCX workflows
→ Creates document with docx-js
→ Enables tracked changes for review workflow
→ Outputs professional .docx with revision marks
Example 2: Fill a PDF form programmatically
User: "Fill out this NDA PDF with my info"
→ Routes to PDF workflows
→ Reads form fields from PDF
→ Fills fields programmatically with pdf-lib
→ Outputs completed, flattened PDF
Example 3: Build financial model spreadsheet
User: "Create a revenue projection spreadsheet"
→ Routes to XLSX workflows
→ Creates workbook with openpyxl
→ Adds formulas (never hardcoded values)
→ Runs recalc.py to update calculations
🔗 Integration with Other Skills
Feeds Into:
- writing skill - Creating documents for blog posts and newsletters
- business skill - Creating consulting proposals and financial models
- research skill - Extracting data from research documents
Uses:
- media skill - Creating images for document illustrations
- development skill - Building document processing automation
- system skill - Command-line tools and scripting
🎯 Key Principles
Document Creation
- Quality First - Professional formatting and structure from the start
- Template Reuse - Leverage existing templates when available
- Validation - Always verify output (visual inspection, error checking)
- Automation - Use scripts for repetitive tasks
Document Editing
- Preserve Intent - Maintain original formatting and structure
- Track Changes - Use proper workflows for document review
- Batch Processing - Group related operations for efficiency
- Error Prevention - Validate before finalizing
Document Analysis
- Right Tool - Choose appropriate library/tool for the task
- Data Integrity - Preserve original data when extracting/converting
- Format Awareness - Understand document structure (OOXML, PDF structure, etc.)
- Performance - Use efficient methods for large documents
📚 Full Reference Documentation
Word Documents (DOCX):
- Main Guide:
~/.claude/skills/documents/docx/SKILL.md - Creation Reference:
~/.claude/skills/documents/docx/docx-js.md - Editing Reference:
~/.claude/skills/documents/docx/ooxml.md
PDF Processing:
- Main Guide:
~/.claude/skills/documents/pdf/SKILL.md - Forms Guide:
~/.claude/skills/documents/pdf/forms.md - Advanced Reference:
~/.claude/skills/documents/pdf/reference.md
PowerPoint Presentations (PPTX):
- Main Guide:
~/.claude/skills/documents/pptx/SKILL.md - Creation Reference:
~/.claude/skills/documents/pptx/html2pptx.md - Editing Reference:
~/.claude/skills/documents/pptx/ooxml.md
Excel Spreadsheets (XLSX):
- Main Guide:
~/.claude/skills/documents/xlsx/SKILL.md - Recalc Script:
~/.claude/skills/documents/xlsx/recalc.py
Summary
The documents skill provides comprehensive document processing:
- DOCX - Create, edit, analyze Word documents with tracked changes support
- PDF - Create, manipulate, extract from PDFs with form filling capabilities
- PPTX - Create, edit presentations with professional design and templates
- XLSX - Create, edit spreadsheets with formulas and financial modeling
Reference-based organization - Each document type has complete guides and tooling
Routing is automatic - Analyzes user intent and activates appropriate document type workflow
Professional quality - Standards and best practices for production-ready documents