addon-deterministic-eval-suite

Pass

Audited by Gen Agent Trust Hub on Mar 3, 2026

Risk Level: SAFE
Full Analysis
  • [COMMAND_EXECUTION]: The skill is designed to execute local shell scripts (e.g., bash evals/deterministic/run.sh) to perform pass/fail evaluation gates. This execution is confined to the project's local directory and serves the primary purpose of the skill.
  • [SAFE]: No remote code execution, data exfiltration, or obfuscation techniques were detected. The skill relies on local configuration files and standard shell commands for validation.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 3, 2026, 11:14 PM