experiment-plan

Pass

Audited by Gen Agent Trust Hub on Apr 19, 2026

Risk Level: SAFECOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
  • [COMMAND_EXECUTION]: The skill specifies a fallback mechanism using Bash (cat << 'EOF' > file) to write large files in chunks. This enables the agent to modify the file system through shell commands when standard API-based write tools reach their limits.- [PROMPT_INJECTION]: The skill is vulnerable to indirect prompt injection due to its ingestion of data from the web and local logs without adequate security controls.
  • Ingestion points: Research data retrieved via WebFetch and local files in the refine-logs/ directory.
  • Boundary markers: Absent; the instructions do not require the agent to use delimiters or ignore embedded commands in the processed data.
  • Capability inventory: The skill uses Bash(*), Write, Edit, and Agent tools, allowing for significant system impact.
  • Sanitization: None; there are no instructions to sanitize or validate the content of the research proposal or web data before processing.
Audit Metadata
Risk Level
SAFE
Analyzed
Apr 19, 2026, 03:14 AM