human-like

Pass

Audited by Gen Agent Trust Hub on Feb 24, 2026

Risk Level: SAFE
Full Analysis
  • [PROMPT_INJECTION]: The instructions implement behavioral steering to override the agent's default tone with a skeptical persona. The intent is to define a specific feedback style for business critiques rather than bypassing core safety guidelines.
  • [PROMPT_INJECTION]: The skill processes untrusted external content, such as URLs and user-provided post drafts. It lacks explicit instructions for the agent to use boundary markers or delimiters, which creates a potential surface for indirect prompt injection where instructions could be hidden in the analyzed text.
Audit Metadata
Risk Level
SAFE
Analyzed
Feb 24, 2026, 07:51 PM