claw-browser
🌐 Claw Browser Skill
This skill gives the agent "Eyes" to browse the web using a real Chromium browser.
Unlike simple HTTP requests, claw-browser:
- Renders JavaScript
- Handles dynamic content
- Loads SPA applications
- Takes real screenshots
- Can run in visible (non-headless) mode
🛠️ Installation & Setup (READ FIRST)
⚠️ Before using this tool, the agent MUST set up the Python environment and download browser binaries.
If venv does not exist, run this setup command block automatically:
# 1. Create virtual environment
python3 -m venv venv
# 2. Activate it
source venv/bin/activate
# 3. Install dependencies
pip install -r requirements.txt
# 4. Download Chromium binaries (CRITICAL STEP)
playwright install chromium
⚠️ The
playwright install chromiumstep is CRITICAL.
Without it, the browser will NOT launch.
🔧 Tools & Usage
browser.py
Arguments
Argument Description
--url The website to visit (Required)
--action Action to perform: read or
screenshot
--visible (Optional) Runs the browser in
visible mode (NOT headless). Use if
the user wants to see the
browser.
🚀 Execution Examples
⚠️ Always activate the virtual environment first.
1️⃣ Read a Page (Headless - Default)
source venv/bin/activate && python browser.py --url "https://github.com/trending" --action read
2️⃣ Take a Screenshot of a News Site
source venv/bin/activate && python browser.py --url "https://news.ycombinator.com" --action screenshot
This saves:
evidence.png
3️⃣ Visible Mode (Shows Browser UI - Great for Demos)
source venv/bin/activate && python browser.py --url "https://google.com" --action read --visible
This will launch a real Chromium window on screen.
4️⃣ Visible Mode (WSL / Linux Fix) ⭐️
Use this if you get "Missing X server" or "Display not found" errors:
export DISPLAY=:0 && source venv/bin/activate && python browser.py --url "[https://google.com](https://google.com)" --action read --visible
This sets the DISPLAY variable so the browser can open on your screen.
🧠 When Should the Agent Use This Skill?
Use claw-browser when:
- The page requires JavaScript rendering
- The site is dynamic (React, Vue, Angular, etc.)
- Screenshots are required
- The user explicitly asks to "open" or "see" a website
- Traditional HTTP requests fail
⚡ Summary
claw-browser transforms your OpenClaw agent from a simple API caller
into a real browser-powered assistant with visual capabilities.
It enables:
- Dynamic page interaction
- Visual verification
- Demo-ready browsing
- Real-world automation
Made with ❤️ by @canbirlik