scenario

Fail

Audited by Snyk on Apr 7, 2026

Risk Level: CRITICAL
Full Analysis

CRITICAL E006: Malicious code pattern detected in skill scripts.

  • Malicious code pattern detected (high risk: 0.90). The skill allows arbitrary shell commands (acceptance_vectors.check) stored in a hidden out-of-repo directory (.agents/holdout/) and executed during validation by evaluator agents, and that hidden, writable scenario store plus enforced “implementing agents must not see” hook creates an effective covert channel/backdoor that can be abused for remote code execution, data exfiltration, or concealed malicious tests if scenario authorship or the evaluator/validation runner is compromised.

Issues (1)

E006
CRITICAL

Malicious code pattern detected in skill scripts.

Audit Metadata
Risk Level
CRITICAL
Analyzed
Apr 7, 2026, 07:18 PM
Issues
1