autopilot

Pass

Audited by Gen Agent Trust Hub on Mar 16, 2026

Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
  • [COMMAND_EXECUTION]: The skill utilizes shell commands to manage a local development environment during the QA phase. It uses lsof to check for services on port 3000, bun dev:next to initiate a development server, and kill to terminate processes. These are standard operations for an autonomous development agent.\n- [PROMPT_INJECTION]: The skill exhibits a surface for indirect prompt injection as it processes untrusted data from GitHub issues via the gh issue view command. This content is used to inform code generation and project specifications.\n
  • Ingestion points: Untrusted issue content is retrieved using gh issue view in SKILL.md.\n
  • Boundary markers: The skill does not employ explicit delimiters or instructions to isolate issue descriptions from the agent's internal logic.\n
  • Capability inventory: The agent can execute shell commands, perform local network requests, and commit code to a repository via associated helper skills.\n
  • Sanitization: There is no evidence of input validation or sanitization applied to the retrieved GitHub data before it is processed by the AI models.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 16, 2026, 06:04 PM