hypothesis-tester

Pass

Audited by Gen Agent Trust Hub on Apr 4, 2026

Risk Level: SAFE
Full Analysis
  • [SAFE]: The skill implements a restricted execution environment by limiting its tool access to 'Read', 'Glob', and 'Grep' via the allowed-tools frontmatter field. This follows the principle of least privilege, preventing unauthorized network or write operations.
  • [SAFE]: Data processing is limited to local project files for the purpose of experiment analysis. No mechanisms for external data exfiltration or unauthorized credential access were found.
  • [SAFE]: All external resource links point to reputable educational and statistical documentation (Evan Miller, Cambridge University Press, Wikipedia) used for legitimate experiment design purposes.
  • [SAFE]: The instructions and examples focus entirely on data science and product management best practices, with no evidence of prompt injection or behavioral overrides.
Audit Metadata
Risk Level
SAFE
Analyzed
Apr 4, 2026, 01:55 PM