evaluation-framework

Pass

Audited by Gen Agent Trust Hub on May 9, 2026

Risk Level: SAFE
Full Analysis
  • [SAFE]: No security issues were detected. The skill consists of architectural patterns and best practices for evaluation frameworks, rubrics, and scoring systems.
  • [SAFE]: The skill does not attempt to execute code, access sensitive files, or perform network operations. The documentation links and command examples provided are benign and intended for instructional purposes.
  • [SAFE]: No obfuscation, prompt injection, or persistence mechanisms were found. The repetitive verification instructions in the SKILL.md file appear to be template artifacts and do not pose a security risk.
Audit Metadata
Risk Level
SAFE
Analyzed
May 9, 2026, 07:38 AM