reflect
Pass
Audited by Gen Agent Trust Hub on Apr 23, 2026
Risk Level: SAFEPROMPT_INJECTION
Full Analysis
- [PROMPT_INJECTION]: The 'Your Identity' section uses aggressive instructional framing including 'ruthless quality gatekeeper', 'NON-NEGOTIABLE', and 'CRITICAL WARNING' along with threats of termination ('You will be killed'). These are patterns designed to override an agent's standard operational persona and safety guidelines by creating a high-pressure behavioral state.\n- [PROMPT_INJECTION]: The skill analyzes 'previous response and output' content for refinement, creating a surface for indirect instructions to influence agent behavior.\n
- Ingestion points: The skill instructions target the agent's most recent output as the data source for reflection.\n
- Boundary markers: No explicit delimiters or instructions are provided to distinguish between the content to be analyzed and potential embedded instructions meant to subvert the process.\n
- Capability inventory: The agent is instructed to perform quality assessments, refinement planning, and architecture evaluations based on the ingested data.\n
- Sanitization: There is no evidence of sanitization or validation of the input content before processing begins.
Audit Metadata