nano-pdf
nano-pdf
Use nano-pdf to apply edits to PDFs using natural-language instructions. This skill provides PDF editing capabilities beyond basic reading.
Installation
# Install uv if not present
curl -LsSf https://astral.sh/uv/install.sh | sh
# Install nano-pdf
uv pip install nano-pdf
Quick Start
Edit a specific page:
nano-pdf edit deck.pdf 1 "Change the title to 'Q3 Results' and fix the typo in the subtitle"
Commands
Edit PDF Page
nano-pdf edit <pdf_file> <page_number> "<instruction>"
page_number: 0-based or 1-based depending on versioninstruction: Natural language description of what to change
Example:
nano-pdf edit report.pdf 0 "Update the header to 'Annual Report 2026'"
Create PDF from Text
nano-pdf create --output output.pdf --text "Your content here"
Merge PDFs
nano-pdf merge --output merged.pdf file1.pdf file2.pdf file3.pdf
Split PDF
nano-pdf split input.pdf --output-dir ./pages/
Extract Pages
nano-pdf extract input.pdf --pages 1-5 --output extracted.pdf
Notes
- Page numbers may be 0-based or 1-based; if results look off, try the other
- Requires OPENAI_API_KEY for AI-powered edits
- Always verify the output PDF before distribution
- For complex edits, break into multiple smaller operations
When to Use
Use this skill when:
- User asks to edit or modify a PDF
- Need to update text in an existing PDF
- Need to merge, split, or extract pages
- Working with reports, presentations, or documents that need updates
Limitations
- Complex layout changes may require multiple attempts
- Image-heavy PDFs may have limited editability
- Always sanity-check the output before sending
More from winsorllc/upgraded-carnival
vector-memory
Vector-based semantic memory using embeddings for intelligent recall. Store and search memories by meaning rather than keywords. Use when you need semantic search, similar document retrieval, or context-aware memory.
131model-router
Route requests between different LLM providers and models. Configure routing rules, fallback providers, and model-specific parameters inspired by ZeroClaw and OpenClaw model routing systems.
63rss-monitor
Monitor RSS/Atom feeds and blogs for new content using feedparser.
59rss-reader
Read and parse RSS/Atom feeds. Use when: user wants to subscribe to feeds, get latest articles, or monitor news sources.
54video-frames
Production-grade video frame extraction with thumbnail grids, GIF creation, and batch frame processing. Includes intelligent quality presets, progress tracking, and comprehensive error handling.
39elevenlabs-tts
Convert text to speech using ElevenLabs API. Use when you need to generate voice audio for messages, narrations, or accessibility.
25