judge-with-debate

Fail

Audited by Snyk on Apr 23, 2026

Risk Level: HIGH
Full Analysis

HIGH W007: Insecure credential handling detected in skill instructions.

  • Insecure credential handling detected (high risk: 1.00). The prompt requires judges to quote "exact text" and include specific evidence from the evaluated solution files (and requests a resolved CLAUDE_PLUGIN_ROOT), which forces the LLM to reproduce verbatim content from artifacts — potentially including API keys, tokens, or passwords embedded in those files or env vars — creating an exfiltration risk.

Issues (1)

W007
HIGH

Insecure credential handling detected in skill instructions.

Audit Metadata
Risk Level
HIGH
Analyzed
Apr 23, 2026, 03:49 AM
Issues
1