browse

SKILL.md

Browser Automation

Help the user automate browser-based tasks using Yutori's Navigator agent.

Process

  1. Understand the task

    • What website needs to be automated?
    • What actions are required? (clicking, typing, extracting data)
    • Does it require login or authentication?
  2. Define the task clearly

    • Break complex workflows into clear steps
    • Specify what data to extract if applicable
    • Note any buttons or elements to interact with
  3. Start the browsing task Use run_browsing_task with:

    • task: Clear natural language instructions
    • start_url: The URL to begin browsing
    • max_steps: 25 (default) to 100 for complex flows
    • output_fields: For structured data extraction (e.g., ["name", "price", "url"])
  4. Poll for results

    • Browsing typically takes 30-120 seconds depending on complexity
    • Use get_browsing_task_result to check status
    • Poll every 10-15 seconds until complete
  5. Review and validate

    • Check the extracted data or confirmation
    • Verify the task completed as expected

Task Writing Tips

  • Be specific about UI elements: "Click the blue 'Submit' button"
  • Reference visible text when possible
  • For forms, specify which fields get which values

$ARGUMENTS

Weekly Installs
5
GitHub Stars
9
First Seen
Feb 10, 2026
Installed on
codex5
amp4
openclaw4
opencode4
cursor4
kimi-cli4