defuddle: Web Page Cleaner

Defuddle extracts the meaningful content from a web page and drops everything else: ads, cookie banners, nav bars, related articles, footers, social sharing buttons. What remains is the article body as clean markdown.

Use this before any URL ingestion. It is optional but strongly recommended. It cuts token usage by 40-60% on typical web articles and produces cleaner wiki pages.

Substrate note (v1.7+): Unlike obsidian-markdown / obsidian-bases / json-canvas (where we defer to kepano/obsidian-skills as upstream), the defuddle skill is original to claude-obsidian — kepano's marketplace does not ship a defuddle skill. This is the canonical version. The underlying defuddle-cli is independent of either marketplace and lives at github.com/kepano/defuddle.

Install

npm install -g defuddle-cli

Verify: defuddle --version