actionbook
Pre-verified page actions and selectors for website automation without runtime discovery.
- Search a library of documented page interactions by task intent, then retrieve structured DOM details with tested CSS selectors ready for browser commands
- Browser commands cover navigation, form filling, clicking, text extraction, screenshots, and waiting for page changes
- Handles login walls by pausing automation and asking the user to complete authentication manually in the same session
- Daemon mode (Unix/CDP) maintains a persistent WebSocket connection per profile, eliminating per-command connection overhead
- Falls back to live accessibility tree snapshots when selectors become outdated due to website changes
When to Use This Skill
Activate when the user:
- Needs to do anything on a website ("Send a LinkedIn message", "Book an Airbnb", "Search Google for...")
- Asks how to interact with a site ("How do I post a tweet?", "How to apply on LinkedIn?")
- Wants to fill out forms, click buttons, navigate, search, filter, or browse on a specific site
- Wants to take a screenshot of a web page or monitor changes
- Builds browser-based AI agents, web scrapers, or E2E tests for external websites
- Automates repetitive web tasks (data entry, form submission, content posting)
- Needs to operate multiple websites or tabs concurrently
How It Works
Actionbook provides up-to-date action manuals for the modern web. Action manuals tell agents exactly what to do on a page — no parsing, no guessing.
Why this matters:
- 10x faster — action manuals provide selectors and page structure upfront. No snapshot-per-step loop needed.
- Accurate — handles SPAs, streaming components, dropdowns, date pickers, and dynamic content reliably.
- Concurrent — stateless architecture with explicit
--session/--tab. Operate dozens of tabs in parallel.
More from actionbook/actionbook
active-research
Deep research and analysis tool. Generates comprehensive HTML reports on any topic, domain, paper, or technology. Enhanced with advanced browser automation — SPA handling, network idle wait, batch operations, stealth browsing, and intelligent page analysis. Use when user asks to research, analyze, investigate, deep-dive, or generate a report on any subject.
1.6Kextract
Extract structured data from websites and produce an executable Playwright script plus extracted data. Use when the user wants to scrape, extract, pull, collect, or harvest data from any website — product listings, tables, search results, feeds, profiles, or any repeating content.
1.1Kdeep-research
Deep research and analysis tool. Generates comprehensive HTML reports on any topic, domain, paper, or technology. Use when user asks to research, analyze, investigate, deep-dive, or generate a report on any subject. Supports academic papers (arXiv), technologies, trends, comparisons, and general topics.
145agent-browser
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
136actionbook-scraper
Generate and verify web scraper scripts using Actionbook's verified selectors. Auto-validates generated scripts and fixes errors.
131arxiv-viewer
View, search, and download academic papers from arXiv. Supports API queries, web scraping via Actionbook, and HTML paper reading via ar5iv. Use when user asks about arxiv papers, academic papers, research papers, paper summaries, latest papers, or wants to search/download/read papers.
130