judge-with-debate

Fail

Audited by Snyk on Feb 16, 2026

Risk Level: HIGH
Full Analysis

HIGH W007: Insecure credential handling detected in skill instructions.

  • Insecure credential handling detected (high risk: 1.00). The prompt explicitly instructs judges to "quote exact text" from solution files and include those quotes in reports, which forces the LLM to reproduce any secrets present in those files verbatim and thus creates an exfiltration risk.
Audit Metadata
Risk Level
HIGH
Analyzed
Feb 16, 2026, 03:35 AM