skills/clawdbot/skills/screen-monitor

screen-monitor

SKILL.md

Screen Monitor

This skill provides two ways for the agent to see and interact with your screen.

🟢 Path A: Fast Share (WebRTC)

Best for: Quick visual checks, restricted browsers, or non-technical environments.

Tools

  • screen_share_link: Generates a local WebRTC portal URL.
  • screen_analyze: Captures the current frame from the portal and analyzes it with vision.

Usage:

# Get the link
bash command:"{baseDir}/references/get-share-url.sh"

# Analyze
bash command:"{baseDir}/references/screen-analyze.sh"

šŸ”µ Path B: Full Control (Browser Relay)

Best for: Deep debugging, UI automation, and clicking/typing in tabs.

Setup

  1. Run clawdbot browser extension install.
  2. Load the unpacked extension from clawdbot browser extension path.
  3. Click the Clawdbot icon in your Chrome toolbar to Attach.

Tools

  • browser action:snapshot: Take a precise screenshot of the attached tab.
  • browser action:click: Interact with elements (requires profile="chrome").

Technical Details

  • Port: 18795 (WebRTC Backend)
  • Files:
    • web/screen-share.html: The sharing portal.
    • references/backend-endpoint.js: Frame storage server.
Weekly Installs
5
Repository
clawdbot/skills
Installed on
windsurf3
opencode3
codex3
clawdbot2
trae2
cursor2