pp-firecrawl
Firecrawl — Printing Press CLI
Prerequisites: Install the CLI
This skill drives the firecrawl-pp-cli binary. You must verify the CLI is installed before invoking any command from this skill. If it is missing, install it first:
- Install via the Printing Press installer:
npx -y @mvanhorn/printing-press install firecrawl --cli-only - Verify:
firecrawl-pp-cli --version - Ensure
$GOPATH/bin(or$HOME/go/bin) is on$PATH.
If the npx install fails (no Node, offline, etc.), fall back to a direct Go install (requires Go 1.23+):
go install github.com/mvanhorn/printing-press-library/library/developer-tools/firecrawl/cmd/firecrawl-pp-cli@latest
If --version reports "command not found" after install, the install step did not put the binary on $PATH. Do not proceed with skill commands until verification succeeds.
Command Reference
batch — Manage batch
firecrawl-pp-cli batch cancel-scrape— Cancel a batch scrape jobfirecrawl-pp-cli batch get-scrape-errors— Get the errors of a batch scrape jobfirecrawl-pp-cli batch get-scrape-status— Get the status of a batch scrape jobfirecrawl-pp-cli batch scrape-and-extract-from-urls— Scrape multiple URLs and optionally extract information using an LLM
crawl — Manage crawl
firecrawl-pp-cli crawl cancel— Cancel a crawl jobfirecrawl-pp-cli crawl get-active— Get all active crawls for the authenticated teamfirecrawl-pp-cli crawl get-status— Get the status of a crawl jobfirecrawl-pp-cli crawl urls— Crawl multiple URLs based on options
deep-research — Manage deep research
firecrawl-pp-cli deep-research get-status— Get the status and results of a deep research operationfirecrawl-pp-cli deep-research start— Start a deep research operation on a query
extract — Manage extract
firecrawl-pp-cli extract data— Extract structured data from pages using LLMsfirecrawl-pp-cli extract get-status— Get the status of an extract job
firecrawl-search — Manage firecrawl search
firecrawl-pp-cli firecrawl-search— Search and optionally scrape search results
llmstxt — Manage llmstxt
firecrawl-pp-cli llmstxt generate-llms-txt— Generate LLMs.txt for a websitefirecrawl-pp-cli llmstxt get-llms-txt-status— Get the status and results of an LLMs.txt generation job
map — Manage map
firecrawl-pp-cli map— Map multiple URLs based on options
scrape — Manage scrape
firecrawl-pp-cli scrape— Scrape a single URL and optionally extract information using an LLM
team — Manage team
firecrawl-pp-cli team get-credit-usage— Get remaining credits for the authenticated teamfirecrawl-pp-cli team get-token-usage— Get remaining tokens for the authenticated team (Extract only)
Finding the right command
When you know what you want to do but not which command does it, ask the CLI directly:
firecrawl-pp-cli which "<capability in your own words>"
which resolves a natural-language capability query to the best matching command from this CLI's curated feature index. Exit code 0 means at least one match; exit code 2 means no confident match — fall back to --help or use a narrower query.
Auth Setup
Store your access token:
firecrawl-pp-cli auth set-token YOUR_TOKEN_HERE
Or set FIRECRAWL_BEARER_AUTH as an environment variable.
Run firecrawl-pp-cli doctor to verify setup.
Agent Mode
Add --agent to any command. Expands to: --json --compact --no-input --no-color --yes.
-
Pipeable — JSON on stdout, errors on stderr
-
Filterable —
--selectkeeps a subset of fields. Dotted paths descend into nested structures; arrays traverse element-wise. Critical for keeping context small on verbose APIs:firecrawl-pp-cli batch cancel-scrape mock-value --agent --select id,name,status -
Previewable —
--dry-runshows the request without sending -
Offline-friendly — sync/search commands can use the local SQLite store when available
-
Non-interactive — never prompts, every input is a flag
Response envelope
Commands that read from the local store or the API wrap output in a provenance envelope:
{
"meta": {"source": "live" | "local", "synced_at": "...", "reason": "..."},
"results": <data>
}
Parse .results for data and .meta.source to know whether it's live or local. A human-readable N results (live) summary is printed to stderr only when stdout is a terminal — piped/agent consumers get pure JSON on stdout.
Agent Feedback
When you (or the agent) notice something off about this CLI, record it:
firecrawl-pp-cli feedback "the --since flag is inclusive but docs say exclusive"
firecrawl-pp-cli feedback --stdin < notes.txt
firecrawl-pp-cli feedback list --json --limit 10
Entries are stored locally at ~/.firecrawl-pp-cli/feedback.jsonl. They are never POSTed unless FIRECRAWL_FEEDBACK_ENDPOINT is set AND either --send is passed or FIRECRAWL_FEEDBACK_AUTO_SEND=true. Default behavior is local-only.
Write what surprised you, not a bug report. Short, specific, one line: that is the part that compounds.
Output Delivery
Every command accepts --deliver <sink>. The output goes to the named sink in addition to (or instead of) stdout, so agents can route command results without hand-piping. Three sinks are supported:
| Sink | Effect |
|---|---|
stdout |
Default; write to stdout only |
file:<path> |
Atomically write output to <path> (tmp + rename) |
webhook:<url> |
POST the output body to the URL (application/json or application/x-ndjson when --compact) |
Unknown schemes are refused with a structured error naming the supported set. Webhook failures return non-zero and log the URL + HTTP status on stderr.
Named Profiles
A profile is a saved set of flag values, reused across invocations. Use it when a scheduled agent calls the same command every run with the same configuration - HeyGen's "Beacon" pattern.
firecrawl-pp-cli profile save briefing --json
firecrawl-pp-cli --profile briefing batch cancel-scrape mock-value
firecrawl-pp-cli profile list --json
firecrawl-pp-cli profile show briefing
firecrawl-pp-cli profile delete briefing --yes
Explicit flags always win over profile values; profile values win over defaults. agent-context lists all available profiles under available_profiles so introspecting agents discover them at runtime.
Exit Codes
| Code | Meaning |
|---|---|
| 0 | Success |
| 2 | Usage error (wrong arguments) |
| 3 | Resource not found |
| 4 | Authentication required |
| 5 | API error (upstream issue) |
| 7 | Rate limited (wait and retry) |
| 10 | Config error |
Argument Parsing
Parse $ARGUMENTS:
- Empty,
help, or--help→ showfirecrawl-pp-cli --helpoutput - Starts with
install→ ends withmcp→ MCP installation; otherwise → see Prerequisites above - Anything else → Direct Use (execute as CLI command with
--agent)
MCP Server Installation
- Install the MCP server:
go install github.com/mvanhorn/printing-press-library/library/other/firecrawl-pp-cli/cmd/firecrawl-pp-mcp@latest - Register with Claude Code:
claude mcp add firecrawl-pp-mcp -- firecrawl-pp-mcp - Verify:
claude mcp list
Direct Use
- Check if installed:
which firecrawl-pp-cliIf not found, offer to install (see Prerequisites at the top of this skill). - Match the user query to the best command from the Unique Capabilities and Command Reference above.
- Execute with the
--agentflag:firecrawl-pp-cli <command> [subcommand] [args] --agent - If ambiguous, drill into subcommand help:
firecrawl-pp-cli <command> --help.
More from mvanhorn/printing-press-library
pp-flight-goat
Search Google Flights, scan Kayak long-haul routes, and join FlightAware AeroAPI reliability, alerts, and tracking from one CLI.
183pp-espn
Use this skill whenever the user asks about live sports scores, standings, team stats, game summaries (with box score, leaders, scoring plays, odds, and win probability), NFL / NBA / MLB / NHL / NCAA / MLS / EPL / WNBA games, team schedules, polls, or rankings. ESPN sports CLI with live scores across 10 leagues, offline search, head-to-head comparisons, and rich per-game summary payloads. No API key required. Triggers on natural phrasings like 'what's the score of the Lakers game', 'Patriots schedule this week', 'NFL standings', 'box score for tonight's Mavs game', 'Chiefs vs Eagles head to head', 'who's on top of the AP poll'.
161pp-recipe-goat
Printing Press CLI for Recipe Goat. Recipe GOAT — find the best version of any recipe across 37 trusted sites, with offline cookbook, pantry match,...
138pp-movie-goat
The movie CLI that combines TMDb's discovery engine with OMDb's multi-source ratings — and ships a SQLite watchlist that flags what's streaming on your services right now. Trigger phrases: `what should I watch tonight`, `where can I stream <title>`, `rate <title>`, `compare <title> and <title>`, `what's <person>'s filmography`, `plan a <franchise> marathon`, `use movie-goat`, `run movie-goat`.
136pp-company-goat
Look up startups across SEC Form D, GitHub, Hacker News, Companies House, YC, and Wikidata in one command — including the SEC fundraising data hidden behind paid Crunchbase tiers. Trigger phrases: `look up this startup`, `research <company>`, `what does <company> do`, `form D for <company>`, `is <company> still active`, `compare <a> and <b>`, `use company-goat`, `run company-goat-pp-cli`.
76pp-hackernews
Hacker News from your terminal — with a local SQLite store, snapshot history, and agent-native output no other HN tool has. Trigger phrases: `check hacker news`, `search hn`, `what is hn saying about`, `diff the hn front page`, `pulse on hn`, `look up hn user`, `hn who is hiring`, `hn top stories`, `use hackernews`, `run hackernews`.
62