agent-evaluation
Fail
Audited by Socket on Mar 11, 2026
1 alert found:
Obfuscated FileObfuscated FileSKILL.md
HIGHObfuscated FileHIGH
SKILL.md
The skill describes a comprehensive evaluation framework for AI agents with well-scoped components (graders, rubrics, transcripts, CI/CD integration) and does not manifest security risks through credential access, external data exfiltration, or supply-chain behaviors. Its footprint is coherent with its stated purpose as a tooling framework for evaluation rather than an agent action surface. Some risk considerations remain at the process level (e.g., ensuring graders and transcripts do not inadvertently leak sensitive data in shared logs), but there are no immediate red flags indicating malicious or high-risk behavior within the described artifacts.
Confidence: 98%
Audit Metadata