scenario
Fail
Audited by Snyk on Apr 7, 2026
Risk Level: CRITICAL
Full Analysis
CRITICAL E006: Malicious code pattern detected in skill scripts.
- Malicious code pattern detected (high risk: 0.90). The skill allows arbitrary shell commands (acceptance_vectors.check) stored in a hidden out-of-repo directory (.agents/holdout/) and executed during validation by evaluator agents, and that hidden, writable scenario store plus enforced “implementing agents must not see” hook creates an effective covert channel/backdoor that can be abused for remote code execution, data exfiltration, or concealed malicious tests if scenario authorship or the evaluator/validation runner is compromised.
Issues (1)
E006
CRITICALMalicious code pattern detected in skill scripts.
Audit Metadata