systematic-debugging

Pass

Audited by Gen Agent Trust Hub on Mar 21, 2026

Risk Level: SAFECOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
  • [COMMAND_EXECUTION]: The skill provides a bash script find-polluter.sh designed to identify which tests in a suite are creating unwanted side effects. The script uses the npm test command to execute discovered test files sequentially. This is a standard utility for debugging test environments.\n- [PROMPT_INJECTION]: Several files (test-pressure-1.md, test-pressure-2.md, test-pressure-3.md) contain high-pressure role-playing scenarios designed to evaluate the agent's adherence to the skill's debugging rules. These scenarios mimic prompt injection techniques to challenge the agent's decision-making in emergency situations.\n- [PROMPT_INJECTION]: The debugging process involves analyzing external data such as error logs and stack traces, which represents an indirect prompt injection surface.\n
  • Ingestion points: Phase 1 of the debugging process (SKILL.md) and root-cause-tracing.md instruct the agent to ingest and analyze error messages and logs.\n
  • Boundary markers: No specific delimiters or "ignore instructions" markers are provided for the analysis of log content.\n
  • Capability inventory: The skill provides command execution capabilities via find-polluter.sh and assumes standard agent access to file systems and development tools.\n
  • Sanitization: The skill does not specify sanitization or validation routines for error data before processing.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 21, 2026, 06:01 AM