Browser Automation with Real Browser MCP

Use this skill when you need to interact with the user's actual browser - clicking, typing, reading pages, taking screenshots, or navigating.

Before You Start

Verify the extension is connected: try browser_tabs with action "list" first
If disconnected, ask the user to check the extension icon (should show green "ON")
Never close tabs you didn't create

Start with browser_snapshot to get the accessibility tree. This gives you refs like "e12" that you use for interaction.

For large pages, scope with a CSS selector: browser_snapshot with selector: "main" or selector: ".content".

Use browser_text to extract raw text when you need the full content.

Always snapshot first, then use refs:

browser_scroll down to load more content
browser_wait with a selector for lazy-loaded elements
Snapshot again after scrolling - refs are regenerated
For virtual scroll containers (Twitter feeds, Reddit), pass the container's CSS selector to browser_scroll