playwright-cli
Warn
Audited by Gen Agent Trust Hub on Mar 10, 2026
Risk Level: MEDIUMREMOTE_CODE_EXECUTIONCOMMAND_EXECUTIONDATA_EXFILTRATIONPROMPT_INJECTIONCREDENTIALS_UNSAFE
Full Analysis
- [REMOTE_CODE_EXECUTION]: The skill includes a
run-codecommand that allows the execution of arbitrary JavaScript code strings within the browser context. This provides a powerful vector for code execution if the agent is subverted. - [COMMAND_EXECUTION]: The skill relies on the
Bashtool to executenpxcommands, which dynamically downloads and runs the@playwright/clipackage. This allows the agent to execute a wide range of browser-related system commands. - [DATA_EXFILTRATION]: The skill provides commands such as
state-save,cookie-get, andlocalstorage-getto interact with sensitive browser data. This information, which may include authentication tokens and session cookies, is saved to local files likeauth.json, creating a risk of credential exposure or exfiltration if these files are accessed by unauthorized entities. - [CREDENTIALS_UNSAFE]: The skill explicitly instructs and facilitates the storage of authentication state (cookies and local storage) in
auth.jsonfiles. While it provides warnings against committing these files, the mechanism itself creates a high-value target for credential theft. - [PROMPT_INJECTION]: The skill's core functionality involves navigating to external websites and extracting their content using
snapshotandeval. This creates a vulnerability to indirect prompt injection. 1. Ingestion points: Untrusted data enters the agent context via thegoto,snapshot, andevalcommands when interacting with external web pages. 2. Boundary markers: The instructions do not define any boundary markers or delimiters to isolate content retrieved from web pages from the agent's system instructions. 3. Capability inventory: The skill possesses extensive capabilities, including executing shell commands viaBashand arbitrary JavaScript via therun-codecommand. 4. Sanitization: There is no evidence of sanitization, filtering, or validation performed on the data extracted from external websites before it is processed by the agent.
Audit Metadata