skill-creator

Pass

Audited by Gen Agent Trust Hub on Apr 18, 2026

Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
  • [COMMAND_EXECUTION]: The skill uses subprocess.run and subprocess.Popen in scripts/run_eval.py and scripts/improve_description.py to interact with the claude CLI for running skill evaluations and description optimization loops. This is a core part of the skill's functionality.
  • [COMMAND_EXECUTION]: Instructions in SKILL.md guide the agent to execute various shell commands including python, bash, nohup, and kill to aggregate benchmark results and manage the evaluation viewer server.
  • [EXTERNAL_DOWNLOADS]: The skill references a public repository on GitHub (anthropics/skills) for attribution; this is a trusted source and is handled according to established security guidelines.
Audit Metadata
Risk Level
SAFE
Analyzed
Apr 18, 2026, 03:41 PM