skill-creator
Pass
Audited by Gen Agent Trust Hub on Mar 27, 2026
Risk Level: SAFECOMMAND_EXECUTIONEXTERNAL_DOWNLOADSPROMPT_INJECTION
Full Analysis
- [COMMAND_EXECUTION]: The script
scripts/run_eval.pyusessubprocess.Popento execute theclaudeCLI for running evaluation queries against skill drafts. Additionally,eval-viewer/generate_review.pyusessubprocess.runto calllsoffor managing local network ports when starting the evaluation viewer server. - [EXTERNAL_DOWNLOADS]: The skill uses the
anthropicPython client inscripts/improve_description.pyandscripts/run_loop.pyto communicate with Anthropic's API for generating and refining skill descriptions. Theeval-viewer/viewer.htmlfile also loads the SheetJS library from a public CDN (cdn.sheetjs.com) to render spreadsheet outputs. - [PROMPT_INJECTION]: The skill processes untrusted user-provided test prompts and execution transcripts through specialized agents like
grader.mdandanalyzer.md. This architecture presents an indirect prompt injection surface where malicious content in a test case could attempt to influence the grading logic. Theagents/grader.mdinstructions include specific defensive guidelines to ensure evidence for passing grades is substantive rather than superficial.
Audit Metadata