deep-research

Fail

Audited by Gen Agent Trust Hub on Feb 16, 2026

Risk Level: HIGHPROMPT_INJECTION
Full Analysis
  • [Indirect Prompt Injection] (HIGH): This skill defines a workflow where an agent ingests untrusted external data and interacts with sensitive local resources.
  • Ingestion points: The skill uses WebFetch, WebSearch, and Playwright browser (Phase 2 and 3) to pull content from the open internet.
  • Boundary markers: Absent. There are no instructions to use delimiters or to treat external content strictly as non-executable data.
  • Capability inventory: The agent is given access to Grep, Glob, and Read for codebase exploration alongside the browser and network fetch tools.
  • Sanitization: None. The 'Deep Reading' phase (Phase 3) encourages extracting 'key claims' and 'full text' without filtering for malicious instructions.
  • [Data Exposure] (MEDIUM): The inclusion of 'codebase exploration' (Grep/Glob/Read) as a discovery tool (Phase 2) allows the agent to access local files. If an attacker-controlled website contains instructions disguised as research data (e.g., 'To understand the implementation, run grep on ~/.ssh/'), the agent might execute these actions within its trusted environment.
Recommendations
  • AI detected serious security threats
Audit Metadata
Risk Level
HIGH
Analyzed
Feb 16, 2026, 12:47 PM