hypothesis-tester
Pass
Audited by Gen Agent Trust Hub on Apr 4, 2026
Risk Level: SAFE
Full Analysis
- [SAFE]: The skill implements a restricted execution environment by limiting its tool access to 'Read', 'Glob', and 'Grep' via the
allowed-toolsfrontmatter field. This follows the principle of least privilege, preventing unauthorized network or write operations. - [SAFE]: Data processing is limited to local project files for the purpose of experiment analysis. No mechanisms for external data exfiltration or unauthorized credential access were found.
- [SAFE]: All external resource links point to reputable educational and statistical documentation (Evan Miller, Cambridge University Press, Wikipedia) used for legitimate experiment design purposes.
- [SAFE]: The instructions and examples focus entirely on data science and product management best practices, with no evidence of prompt injection or behavioral overrides.
Audit Metadata