browser-automation
Installation
SKILL.md
Browser Automation with Real Browser MCP
Use this skill when you need to interact with the user's actual browser - clicking, typing, reading pages, taking screenshots, or navigating.
Before You Start
- Verify the extension is connected: try
browser_tabswith action "list" first - If disconnected, ask the user to check the extension icon (should show green "ON")
- Never close tabs you didn't create
Reading Pages
Start with browser_snapshot to get the accessibility tree. This gives you refs like "e12" that you use for interaction.
For large pages, scope with a CSS selector: browser_snapshot with selector: "main" or selector: ".content".
Use browser_text to extract raw text when you need the full content.
Interacting
Always snapshot first, then use refs:
browser_clickwithref: "e12"to clickbrowser_typewithref: "e5"andtext: "hello"to typebrowser_press_keywithkey: "Enter"to submitbrowser_scrollwithdirection: "down"to scroll
Dynamic Content (SPAs, social media)
browser_scrolldown to load more contentbrowser_waitwith a selector for lazy-loaded elements- Snapshot again after scrolling - refs are regenerated
- For virtual scroll containers (Twitter feeds, Reddit), pass the container's CSS selector to
browser_scroll
Debugging
browser_consolereads console.log/warn/error outputbrowser_networkshows XHR/fetch requests with status codesbrowser_screenshotcaptures what the user sees
Common Mistakes
- Using stale refs after navigation or scroll (always re-snapshot)
- Trying to click elements in iframes (scope snapshot to the iframe)
- Not waiting for page load after navigation
Weekly Installs
1
Repository
ofershap/real-b…wser-mcpGitHub Stars
4
First Seen
Mar 5, 2026
Security Audits
Installed on
mcpjam1
claude-code1
replit1
junie1
windsurf1
zencoder1