autoresearch
Pass
Audited by Gen Agent Trust Hub on Mar 14, 2026
Risk Level: SAFECOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
- [COMMAND_EXECUTION]: The skill facilitates the execution of shell commands through the
run_shell_commandfunction inscripts/common.py. This capability is used byscripts/run_experiment.pyto perform benchmark trials and validation checks defined in the experiment configuration. - [PROMPT_INJECTION]: The
SKILL.mdfile contains instructions that direct the agent to adhere to a structured 'experiment loop,' including mandatory up-front Q&A and hypothesis documentation, which serves as a behavioral constraint for the agent's operation.
Audit Metadata