skills/stablyai/orca/auto-review-fix/Gen Agent Trust Hub

auto-review-fix

Warn

Audited by Gen Agent Trust Hub on Apr 4, 2026

Risk Level: MEDIUMPROMPT_INJECTIONCOMMAND_EXECUTION
Full Analysis
  • [PROMPT_INJECTION]: The skill contains explicit instructions to override the agent's standard safety protocols regarding human confirmation. This suppresses user oversight for destructive actions like file modifications and shell command execution.
  • Evidence: "IMPORTANT: Execute this entire process autonomously without asking the user for confirmation at any step. Just do the iterations." (SKILL.md)
  • Evidence: "DO NOT ask for user confirmation
  • Execute autonomously" (SKILL.md)
  • [COMMAND_EXECUTION]: The skill executes several shell commands to manage repository state and verify fixes.
  • Evidence: git diff $(git merge-base origin/main HEAD) used to fetch changes.
  • Evidence: npm run typecheck used to verify code integrity after fixes.
  • Evidence: rm -f used during cleanup of context files.
  • [PROMPT_INJECTION]: Indirect Prompt Injection Surface. The skill processes untrusted code changes which could contain instructions designed to influence the behavior of the review or fix subagents.
  • Ingestion points: The skill reads git diff output and file contents in Phase 0 and Phase 1, which are then passed to subagents.
  • Boundary markers: Absent. The subagent prompts do not use delimiters or instructions to ignore instructions embedded within the code being reviewed.
  • Capability inventory: The skill utilizes shell command execution (git, npm, rm) and the Edit tool for file modifications.
  • Sanitization: None. Code content is interpolated directly into subagent prompts without escaping or validation.
Audit Metadata
Risk Level
MEDIUM
Analyzed
Apr 4, 2026, 05:30 PM