eval-harness
Pass
Audited by Gen Agent Trust Hub on Mar 23, 2026
Risk Level: SAFENO_CODE
Full Analysis
- [SAFE]: No security issues detected. The skill is purely documentation-based and provides guidelines for setting up evaluation frameworks.
- [NO_CODE]: This skill does not include any executable scripts, binary files, or automated logic. It serves as a set of templates and best practices for developers to follow manually or implement in their own environments.
Audit Metadata