eval-harness
Warn
Audited by Snyk on Feb 27, 2026
Risk Level: MEDIUM
Full Analysis
MEDIUM W013: Attempt to modify system services in skill instructions.
- Attempt to modify system services in skill instructions detected (high risk: 0.90). The prompt explicitly includes an eval example "Can create new user account", which would instruct the agent to perform privileged, system-state–modifying actions (creating users), so it pushes the agent toward compromising the machine state.
Audit Metadata