eval-harness

Warn

Audited by Snyk on Feb 27, 2026

Risk Level: MEDIUM
Full Analysis

MEDIUM W013: Attempt to modify system services in skill instructions.

  • Attempt to modify system services in skill instructions detected (high risk: 0.90). The prompt explicitly includes an eval example "Can create new user account", which would instruct the agent to perform privileged, system-state–modifying actions (creating users), so it pushes the agent toward compromising the machine state.
Audit Metadata
Risk Level
MEDIUM
Analyzed
Feb 27, 2026, 05:58 PM