ctx by ethan-huo/ctx

Execution rules

ctx is a global CLI tool. Run it directly (ctx read ...), never cd into the skill directory first.
Always use 2>&1 so stderr diagnostics (auto-retry messages, warnings) are visible.
Never pipe through head, tail, cut, or any truncation — ctx already manages content length via structural summaries.
Never use 2>/dev/null — stderr carries actionable diagnostics, not noise.

Core Workflow: search → read

1. Find documentation sources

ctx docs <library-name> "<query>"

Returns relevant document URLs. Be specific with queries — include language/framework name when ambiguous.

ctx docs mlx-swift "GPU stream thread safety"
ctx docs react "useEffect cleanup async"
ctx docs /ml-explore/mlx-swift "lazy evaluation"   # direct library ID

Use ctx search <name> [query] first if you need to find the right library.

2. Read documents

ctx read <url>

Flag	Short	Default	Description
(positional)		optional	URL or local path (`https://`, `github://`, `file://`, `/path`, `./path`)
`--no-cache`		false	Bypass cache, always fetch fresh
`--toc`		false	Show heading outline with section numbers and line counts
`--section`	`-s`		Section(s) to extract (e.g. `1`, `1-3`, `1.2,3.1`)
`--data`	`-d`		CF API request body (JSON5, `@file`, or stdin)

Auto-detects URL type and fetches accordingly:

Local file / file:// → direct read (always full content, no summary)
github://owner/repo@ref/path → GitHub API (supports @ref for versioned docs)
https://github.com/.../blob/... → GitHub API (auto-converted)
https://... (markdown/text/JSON/XML/YAML) → direct fetch
https://... (HTML/SPA) → auto JS rendering fallback via Cloudflare

When rendering via Cloudflare, a content-density heuristic automatically strips navigation, sidebar, header, and footer noise. This works for most sites without any configuration.

If the default cleanup selects the wrong content block on a specific site, use -d with addScriptTag to replace it with your own extraction logic:

ctx read -d '{url: "https://example.com", addScriptTag: [{content: "document.body.innerHTML = document.querySelector(\".doc-body\").outerHTML"}]}'

To find the right selector, probe the page first:

ctx scrape <url> -s "main" -s "article" -s ".content" — try common selectors, see which returns the content you need
Use the matched selector in addScriptTag to override the default cleanup

Other useful -d parameters: cookies, waitForSelector, gotoOptions.waitUntil, viewport.

stdout/stderr contract: stdout = document content, stderr = diagnostics (auto-retry, warnings). Both matter — see execution rules above.

3. Navigate large documents

Documents over 2000 lines produce a structural summary instead of the full content. The output starts with [ctx:summary] and shows every section heading with line counts and a preview:

[ctx:summary] 5000 lines, 12 sections. Read sections: ctx read <url> -s <number>
Full content: ~/.cache/ctx/{hash}.md

# 1 Getting Started (45 lines)
This library provides a unified interface...
...

## 1.1 Installation (12 lines)
npm install ctx-render
...

How to work with summaries:

Read the summary to understand the full document structure
Use -s to read the sections you need: ctx read <url> -s 2.1
Combine sections: ctx read <url> -s "1-3,5.2"
Or read the cache file path directly for the full raw content

Use --toc for a compact outline without previews.

Choosing the right tool

Explore before acting. When facing an unfamiliar site, start with ctx read or ctx links to understand what's there, then decide which tool fits.

Situation	Tool	Example
Need one page's full content	`ctx read <url>`	Read a doc page
Page is too long (>2000 lines)	`ctx read <url> -s <section>`	Navigate via structural summary
Only need specific elements from a page	`ctx scrape <url> -s "selector"`	Extract an API table, skip sidebar noise
Need content from many pages on one site	`ctx crawl <url> --limit N`	Pull an entire docs section
Don't know which pages to read	`ctx links <url>` then `ctx read`	Explore site structure first
Need visual info (UI, charts, layouts)	`ctx screenshot <url>`	Inspect rendered page; `--scroll 900` for below-the-fold
Screenshot only content, skip nav/ads	`ctx screenshot <url> --selector "main"`	Common: `main`, `article`, `.content`, `#content`
Need structured data extraction	`ctx json <url> --prompt "..."`	Extract pricing tiers as JSON

Fast SPA detection and fallback

ctx read returning empty or near-empty content is a strong signal that the page is a heavy SPA (JS-rendered, no semantic HTML, hashed CSS classes). When this happens:

Skip selector-based approaches — ctx scrape -s "main" and ctx screenshot --selector will likely fail too, because the same rendering instability that blocks ctx read also means DOM elements are unreliable.
Go straight to ctx screenshot — the default (no --selector) always works because it captures the full viewport. Use the metadata output (page=N viewport=N screen=X/Y) to navigate with --scroll.
Do NOT try to probe the DOM via addScriptTag JS injection — on SPAs, scripts often execute before the app renders, producing empty results.

Quick decision tree:

ctx read <url> → got content?
  YES → use read/scrape/screenshot --selector as needed
  NO  → heavy SPA → ctx screenshot + --scroll (skip selector attempts)

Common compositions

Docs research: ctx docs <lib> "<query>" → ctx read <url> → ctx read <url> -s N for deep sections
Full-site understanding: ctx crawl <url> --limit 20 --depth 2 (replaces manual links + read loop)
Surgical extraction: ctx read <url> --toc to find target → ctx scrape <url> -s "table.params" to extract it
Content-focused screenshot (semantic HTML sites only): ctx scrape <url> -s "main" -s "article" to find the right selector → ctx screenshot <url> --selector "article"

Browser Rendering Commands

These commands require ctx auth login cloudflare. Each has a dedicated reference — read it before first use:

Command	Use when	Reference
`ctx screenshot <url>`	Need visual information (UI, charts, layouts)	references/screenshot.md
`ctx links <url>`	Explore a site's link structure before reading	references/links.md
`ctx scrape <url> -s "selector"`	Extract specific elements (tables, code blocks)	references/scrape.md
`ctx json <url> --prompt "..."`	Extract structured data as JSON	references/json.md
`ctx crawl <url>`	Pull multiple pages from a documentation site	references/crawl.md

All commands support -d for full API request body (JSON5, @file, or stdin). Flags override -d fields.

Configuration

When you need to manage site authentication (cookies, headers), cache TTL, or viewport defaults, read references/settings.md.

Common pattern: user provides cookies/auth for a site → store as site headers → all subsequent requests are authenticated:

ctx site set example.com Cookie "sid=abc; token=xyz"

Self-Improvement

When you encounter friction — a command that doesn't work as expected, a misleading instruction in this skill, or confusing parameter design — read references/feedback.md and file a GitHub issue.