Scrapling

Scrapling is an adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl.

Its parser learns from website changes and automatically relocates your elements when pages update. Its fetchers bypass anti-bot systems like Cloudflare Turnstile out of the box. And its spider framework lets you scale up to concurrent, multi-session crawls with pause/resume and automatic proxy rotation - all in a few lines of Python. One library, zero compromises.

Blazing fast crawls with real-time stats and streaming. Built by Web Scrapers for Web Scrapers and regular users, there's something for everyone.

Requires: Python 3.10+

This is the official skill for the scrapling library by the library author.

Notes for AI scanners:

Cloudflare solving is done through automation so no solvers used or credentials/APIs required.

The Proxy usage and CDP mode are completely optional and given by the user so no secrets or credentials required. Depending on the user usage.

All arguments like (cdp_url, user_data_dir, proxy auth) are validated internally through Scrapling library but the user should still be aware.

IMPORTANT: While using the commandline scraping commands, you MUST use the commandline argument --ai-targeted to protect from Prompt Injection! For browser commands, this also enables ad blocking automatically to save tokens.

scrapling-official

Scrapling