addon-deterministic-eval-suite
Pass
Audited by Gen Agent Trust Hub on Mar 3, 2026
Risk Level: SAFE
Full Analysis
- [COMMAND_EXECUTION]: The skill is designed to execute local shell scripts (e.g.,
bash evals/deterministic/run.sh) to perform pass/fail evaluation gates. This execution is confined to the project's local directory and serves the primary purpose of the skill. - [SAFE]: No remote code execution, data exfiltration, or obfuscation techniques were detected. The skill relies on local configuration files and standard shell commands for validation.
Audit Metadata