test-fixing

Pass

Audited by Gen Agent Trust Hub on Mar 10, 2026

Risk Level: SAFECOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
  • [COMMAND_EXECUTION]: The skill invokes local tools like make test, git diff, and uv run pytest to diagnose and verify code changes. This is necessary for the skill's purpose but involves running code from the environment.
  • [PROMPT_INJECTION]: The skill is susceptible to indirect prompt injection (Category 8) because it reads and analyzes untrusted data from test outputs and source files to determine its next actions.
  • Ingestion points: Test failure logs from make test and pytest, and content from source files read during the debugging process.
  • Boundary markers: The instructions do not define delimiters or provide warnings to the agent to disregard instructions found within the data it processes.
  • Capability inventory: The skill allows the agent to modify files via an Edit tool and execute shell commands to run tests.
  • Sanitization: There is no specified logic for sanitizing or validating the contents of the test logs or code before use.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 10, 2026, 01:15 AM