evaluation-suites
Pass
Audited by Gen Agent Trust Hub on Mar 30, 2026
Risk Level: SAFE
Full Analysis
- [SAFE]: The skill provides documentation for the 'opik' Python library, which is a legitimate tool provided by the author (Comet ML) for evaluating AI agents and models.
- [SAFE]: The included code example demonstrates standard usage of the evaluation framework, including defining testing criteria (assertions) and execution policies for CI/CD integration.
- [SAFE]: No evidence of hardcoded credentials, unauthorized data access, or malicious command execution was found. The skill adheres to best practices by using the official SDK for the specified platform.
Audit Metadata